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^FARTM^KfT OF COMMERCE PATENT AND TRADEMARK OFFICE 




^ PCfMi. S £«zAUG 2001 



TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. 371 



'RNEY'S DOCKET NUMBER 

1781-0163P 



U.S. APPLICATION NO. (If known, see 37 CFR 1.5) 

09/94f3v898 



INTERNATIONAL APPLICATION NO. 

PAC/SG99/00079 



INTERNATIONAL FILING DATE 

July 16, 1999 



PRIORITY DATE CLAIMED 

February 18, 1999 



TITLE OF INVENTION 

CHIMERIC GENE CONSTRUCTS FOR GENERATION OF FLUORESCENT TRANSGENIC ORNAMENTAL FISH 



APPLICANT(S) FOR DO/EO/US 

GONG, Zhiyuan; HE, Jiangyan; JU, Bensheng; LAM, Toong Jin; XU, Yanf ei ; YAN, Tie 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information 



3.M 



This is a FIRST submission of items concerning a filing under 35 U.S.C. 371. 

This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371. 
This express request to begin national examination procedures (35 U.S.C. 371(f)) at any time rather than delay 
examination until the expiration of the applicable time limit set in 35 U.S.C. 371(b) and PCT Articles 22 and 39 (1). 
The US has been elected by the expiration of 19 months from the priority date (Article 31). 
5. [X] A copy of the International Application as filed (35 U.S.C. 371(c)(2)) 

a. [^] is transmitted herewith (required only if not transmitted by the International Bureau). 

b. has been transmitted by the International Bureau. WO 00/49150 ^ 
is not required, as the application was filed in the United States Receiving Office (RO/US). 

An English language translation of the International Application as filed (35 U.S.C. 371(c)(2)). 

a. [X] is transmitted herewith. 

b. \^] has been previously submitted under 35 U.S.C. 154(d)(4) 

Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371(c)(3)). 

a. \^\ are transmitted herewith (required only if not transmitted by the International Bureau). 

b. | | have been transmitted by the International Bureau, 
have not been made; however, the time limit for making such amendments has NOT expired. 

d. DKI have not been made and will not be made. 

An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)). 
An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). 

10. □ An English language translation of the annexes of the International Preliminary Examination Report under PCT Article 36 

(35 U.S.C. 371(c)(5)). 

Items 11. to 20. below concern document(s) or information included: 

11. ^ An Information Disclosure Statement under 37 CFR 1.97 and 1 .98. /International Search Report (PCT/ISA/210). 

12. □ An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is included. 

13. [X] A FIRST preliminary amendment. 

14. Q A SECOND or SUBSEQUENT preliminary amendment. 

15. □ A substitute specification. 

16. □ A change of power of attorney and/or address letter. 

17. O A computer-readable form of the sequence listing in accordance with PCT Rule 13ter.2 and 35 U.S.C. 1.821-1.825. 

18. □ A second copy of the published international application under 35 U.S.C. 154(d)(4). 

19. □ A second copy of the English language translation of the international application under 35 U.S.C. 154(d)(4). 

20. [X] Other items or iriforrnation: 

PCT/IPEA/409 

Fifteen (15) sheets of formal drawings 
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U.S. APPLICATION NO (if known, sec 37 CFR 1 5) 



INTERNATIONAL APPLICATION NO 

PCT/SG99/00079 



ATTORNEY'S DOCKET NUMBER 
1781-0163P 



21. IX The following fees are submitted: 
BASIC NATIONAL FEE (37 CFR 1.492(a)(l)-(5): 
Neither international preliminary examination fee (37 CFR 1.482) 
nor international search fee (37 CFR 1.445(a)(2)) paid to USPTO 

and International Search Report not prepared by the EPO or JPO $1,000.00 



International preliminary examination fee (37 CFR 1 .482) not paid to 
USPTO but International Search Report prepared by the EPO or JPO . 



International preliminary examination fee (37 CFR 1 .482) not paid to USPTO 
but international search fee (37 CFR 1.445(a)(2)) paid to USPTO 



International preliminary examination fee (37 CFR 1.482) paid to USPTO 
but all claims did not satisfy provisions of PCT Article 33(1 )-(4) 



International preliminary examination fee (37 CFR 1 .482) paid to USPTO 
and all claims satisfied provisions of PCT Article 33(l)-(4) 

ENTER APPROPRIATE BASIC FEE AMOUNT 



$860.00 



$710.00 



$690.00 



$100.00 



CALCULATIONS 



PTO USE ONLY 



1,000.00 



Surcharge of $130.00 for furnishing the oath or declaration later than O 20 [X] 30 
months from the earliest claimed priority date (37 CFR 1 .492(e)). 



130.00 



1CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



TotarClaims 



39 



20 = 



19 



X $18.00 



342.00 



Independent Claims 



3 



X $80.00 



400.00 



MUICTPLE DEPENDENT CLAIM(S) (if applicable) 



Yes 



$270.00 



270.00 



TOTAL OF ABOVE CALCULATIONS = 



£<] Applicant claims small entity status. See 37 CFR 1.27. The fees indicated above ; 
reduced by 1/2. 



2,142.00 



-1,071.00 



SUBTOTAL = 



1,071.00 



Processing fee of $130.00 for furnishing the English translation later than 
mon|hV from the earliest claimed priority date (37 CFR 1 .492(f)). 



□20 D30 

+ 



0.00 



TOTAL NATIONAL FEE = 



1,071.00 



Fee fqr recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.3 1). $40.00 per property 



0.00 



TOTAL FEES ENCLOSED = 



1,071.00 



Amount to be: 
refunded 



$ 



charged $ 



a. S A check in the amount of $ 1,071.00 to cover the above fees is enclosed. 



in the amount of $ 



to cover the above fees. 



b. O Please charge my Deposit Account. No. 

A duplicate copy of this sheet is enclosed. 

c. [>3 The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 

overpayment to Deposit Account No. 02-2448 . 

NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1.137(a) or (b)) must be filed and granted to restore the application to pending status. 

Send all correspondence to: 

Birch, Stewart, Kolasch & Birch, LLP or Customer No. 2292 
P.O. Box 747 

Falls Church, VA 22040-0747 
(703)205-8000 

Date: August 17. 2001 
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PATENT 
1781-0163P 

IN THE U.S. PATENT AND TRADEMARK OFFICE 

GOGN, Zhiyuan et al . Conf . : 

No. : PCT/SG99/00079 

New Group : 

August 17, 2 001 Examiner: 

CHIMERIC GENE CONSTRUCTS FOR 
GENERATION OF FLUORESCENT 
TRANSGENIC ORNAMENTAL FISH 

PRELIMINARY AMENDMENT 

BOX PATENT APPLICATION 

Assistant Commissioner for Patents August 17 2001 

Washington, DC 20231 

Sir: 

The following Preliminary Amendments and Remarks are 
respectfully submitted in connection with the above -identified 
application. 

AMENDMENTS 

IN THE SPECIFICATION: 

Please amend the specification as follows: 

Before line 1, insert --This application is the national 
phase under 35 U.S.C. § 371 of PCT International Application No. 
PCT/SG99/00079 which has an International filing date of July 16, 
1999, which designated the United States of America 
and was published in English. -- 



Applicant : 
Int ' 1 . Appl . 
Appl . No . : 
Filed: 
For: 




Docket No. 1781-0163P 

REMARKS 

The specification has been amended to provide a cross- 
reference to the previously filed International Application. 

Entry of the above amendments is earnestly solicited. An 
early and favorable first action on the merits is earnestly 
solicited. 

If necessary, the Commissioner is hereby authorized in this, 
concurrent, and future replies, to charge payment or credit any 
overpayment to Deposit Account No. 02-2448 for any additional 
fees required under 37 C.F.R. § 1.16 or under 37 C.F.R. § 1.17; 
particularly, extension of time fees. 

Respectfully submitted, 

BIRCH, STEWART, KOLAS CH & BIRCH, LLP 




By_ 

Gerald M. Murphy, Jr., #28,977 
P.O. Box 747 

GMM/rem Falls Church, VA 22040-0747 

1781-0163P (703) 205-8000 

Attachment : 
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BOX SEQUENCE 

PATENT 
1781-0163P 



IN THE U.S. PATENT AND TRADEMARK OFFICE 

GONG et al . Conf . : 5940 

09/913,898 Group: To Be Assigned 

August 17, 2001 Examiner: TO BE ASSIGNED 

CHIMERIC GENE CONSTRUCTS FOR GENERATION 
OF FLUORESCENT TRANSGENIC ORNAMENTAL 
FISH 

AMENDMENT 



Assistant Commissioner for Patents 
Washington, DC 20231 

Sir: 



November 27, 2001 



In response to the Notification of Missing Requirements Under 
15 U.S.C. 371 in the United States Designated/Elected Office 
(DO/EO/US) mailed October 30, 2001, the following amendments and 
remarks are respectfully submitted in connection with the above- 
identified application . 



In the Specification : 

Please delete pages 24-46 of the specification containing the 
Sequence Listing. Please renumber the remaining pages of the 
specification, beginning with the "References", consecutively from 



Docket No. 1781-0163P 

page 24. Please insert the Substitute Sequence Listing enclosed 
herewith immediately after the abstract. 

In the References : 

Please renumber the pages containing the references, pages 47- 
51, to 24-28, respectfully. 

In the Claims : 

Please renumber the pages containing the claims, pages 52-53, 
to 29-30, respectfully. 

In the Abstract : 

Please renumber the page containing the abstract, page 54, to 

31. 

REMARKS 

Enclosed herewith in full compliance to 37 C.F.R. §§1.821- 
1.825 is a Substitute Sequence Listing to be inserted into the 
specification as indicated above. The Substitute Sequence Listing 
in no way introduces new matter into the specification. Also 
submitted herewith in full compliance to 37 C.F.R. §§1.821-1.825 
is a disk copy of the Substitute Sequence Listing. The disk copy 



Docket No. 1781-0163P 

of the Sequence Listing, file "2001-11-12 178 1- 1 63P . txt " , is 
identical to the paper copy, except that it lacks formatting. 

The amendments to the Specification, References, Claims, and 
Abstract are merely to correct page numbers. No new matter is 
introduced by these amendments. 

If necessary, the Commissioner is hereby authorized in this, 
concurrent, and future replies, to charge payment or credit any 
overpayment to Deposit Account No. 02-2448 for any additional fees 
required under 37 C.F.R. § 1.16 or under 37 C.F.R. § 1.17; 
particularly, extension of time fees. 

Respectfully submitted, 

BIRCH, STEWART, KOLASCH & BIRCH, LLP 

By ^jfu^X /3u^iAjf-^^59-o^ 

y^iJGerald M. Murphy, /Jr., #28, 977 
C/P.O. Box 747 

Falls Church, VA 22040-0747 
GMM/LPS (703) 205-8000 

Attachments: Disk Copy of Substitute Sequence Listing 

Paper Copy of Substitute Sequence Listing 
Copy of Notification 

(Rev. 03/27/01! 
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SEQUENCE LISTING 




GONG, Zhiyuan 
LAM, Toong Jin 
JU, Bensheng 
XU, Yanfei 
HE, Jiangyan 
YAN, Tie 

^ L -"<120> CHIMERIC GENE CONSTRUCTS FOR GENERATION OF FLUORESCENT 
TRANSGENIC ORNAMENTAL FISH 



<130> 1781-0163P 



<140> 09/913, ! 
<141> 2001-08-17 



<150> PCT/SG99/00079 
<151> 1999-01-14 

<160> 24 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 2480 

<212> UNA 

<213> Danio rerio 



<220> 
<221> CDS 
|<222> (90) 



. (1586) 



<220> 

|<221> primer_bind 

! <222> (66) . . (85) 

<223> CK2 

<220> 

<221> primer_bind 
<222> (97) . .Tl20) 
<223> CK1 

<220> 

<221> polyA_signal 
<222> (2446) . . (2451) 

<400> 1 

ctctcctttg tgagcaacct cctccactca ctcctctctc agagagcact ctcgtacctc 60 



cttctcagca actcaaagac acaggcatc atg tea acc agg tct ate tct tac 

Met Ser Thr Arg Ser lie Ser Tyr 
1 5 



113 



tec age ggt ggc tec ate agg agg ggc tac acc age cag tea gec tat 161 
Ser Ser Gly Gly Ser He Arg Arg Gly Tyr Thr Ser Gin Ser Ala Tyr 
10 15 20 



m 



gca gta cct gcc ggc tct acc agg atg age tea gtg acc agt gtc agg 
Ala Val Pro Ala Gly Ser Thr Arg Met Ser Ser Val Thr Ser Val Arg 
25 30 35 40 



209 



aga tct ggt gtg ggt gcc age cca ggc ttc ggt gcc ggt ggc age tac 
Arg Ser Gly Val Gly Ala Ser Pro Gly Phe Gly Ala Gly Gly Ser Tyr 
45 50 55 



257 



age ttt age age age age atg ggt gga ggc tat gga agt ggt ctt ggt 
Ser Phe Ser Ser Ser Ser Met Gly Gly Gly Tyr Gly Ser Gly Leu Gly 
60 65 70 



305 



gga ggt etc ggt ggg ggc atg ggc ttt cgt tgc ggg ctt cct ate aca 
Gly Gly Leu Gly Gly Gly Met Gly Phe Arg Cys Gly Leu Pro lie Thr 
75 80 85 



353 



get gta act gtc aac cag aac ctg ttg gcc ccc tta aac ctg gaa ate 
Ala Val Thr Val Asn Gin Asn Leu Leu Ala Pro Leu Asn Leu Glu lie 
90 95 100 



401 



gac ccc aca att caa get gtc cgc act tea gag aaa gag cag att aag 
Asp Pro Thr lie Gin Ala Val Arg Thr Ser Glu Lys Glu Gin lie Lys 
105 110 115 120 



449 



acc ttc aac aac cgc ttc get ttc etc ate gac aaa gtg cgc ttc ctg 
Thr Phe Asn Asn Arg Phe Ala Phe Leu lie Asp Lys Val Arg Phe Leu 
125 130 135 



497 



gaa cag cag aac aag atg ctt gag acc aaa tgg agt ctt etc caa gaa 
Glu Gin Gin Asn Lys Met Leu Glu Thr Lys Trp Ser Leu Leu Gin Glu 
140 145 150 



545 



cag aca acc aca cgt tec aac ate gat gcc atg ttt gag gca tac ate 
Gin Thr Thr Thr Arg Ser Asn lie Asp Ala Met Phe Glu Ala Tyr lie 
155 160 . 165 



593 



tct aac ctg cgc aga cag etc gat gga ctg gga aat gag aag atg aag 
Ser Asn Leu Arg Arg Gin Leu Asp Gly Leu Gly Asn Glu Lys Met Lys 
170 175 180 



641 



ctg gag gga gag ctg aag aac atg cag ggc ctg gtt gag gac ttc aag 
Leu Glu Gly Glu Leu Lys Asn Met Gin Gly Leu Val Glu Asp Phe Lys 
185 190 195 200 



689 



aac aag tac gag gat gag ate aac aag cgt get tec gta gag aat gag 
Asn Lys Tyr Glu Asp Glu lie Asn Lys Arg Ala Ser Val Glu Asn Glu 
205 210 215 



737 



ttt gtc ctg etc aag aag gat gtt gat gca gcc tac atg aac aag gtt 
Phe Val Leu Leu Lys Lys Asp Val Asp Ala Ala Tyr Met Asn Lys Val 
220 225 230 



785 



gag ctt gaa gcc aag gtt gat get ctt cag gat gag ate aac ttc etc 
Glu Leu Glu Ala Lys Val Asp Ala Leu Gin Asp Glu He Asn Phe Leu 
235 240 245 



833 



2 



agg gca gtc tac gag get gaa etc egg gag etc cag tct cag ate aag 881 

Arg Ala Val Tyr Glu Ala Glu Leu Arg Glu Leu GLn Ser Gin lie Lys 
250 255 260 

gac aca tct gtt gtt gta gaa atg gac aac age aga aac ctg gat atg 929 

Asp Thr Ser Val Val Val Glu Met Asp Asn Ser Arg Asn Leu Asp Met 

265 270 275 280 

gac tec ate gtg get gaa gtt cgc get cag tat gaa gac ate gec aac 977 

Asp Ser lie Val Ala Glu Val Arg Ala Gin Tyr Glu Asp lie Ala Asn 
285 290 295 

cgc age cgt gec gag gca gag age tgg tac aaa cag aag ttt gag gag 1025 

Arg Ser Arg Ala Glu Ala Glu Ser Trp Tyr Lys Gin Lys Phe Glu Glu 
300 305 310 

atg cag age acc get ggt cag tat ggt gat gac etc cgc tea aca aag 1073 

Met Gin Ser Thr Ala Gly Gin Tyr Gly Asp Asp Leu Arg Ser Thr Lys 
315 320 325 

get gag att get gaa etc aac cgc atg ate gec cgc ctg cag aac gag 1121 

Ala Glu lie Ala Glu Leu Asn Arg Met lie Ala Arg Leu Gin Asn Glu 
330 335 340 

ate gat get gtc aag gca cag cgt gec aac ttg gag get cag att get 1169 

■ He Asp Ala Val Lys Ala Gin Arg Ala Asn Leu Glu Ala Gin He Ala 

345 350 355 360 

gag get gaa gag cgt gga gag ctg gca gtg aag gat gec aag etc cgc 1217 

Glu Ala Glu Glu Arg Gly Glu Leu Ala Val Lys Asp Ala Lys Leu Arg 
365 370 ' 375 

ate agg gag ctg gag gaa get ctt cag agg gec aag caa gac atg gec 1265 

He Arg Glu Leu Glu Glu Ala Leu Gin Arg Ala Lys Gin Asp Met Ala 
380 385 390 

cgc cag gtc cgc gag tac cag gag etc atg aac gtc aaa ttg get ctg 1313 

Arg Gin Val Arg Glu Tyr Gin Glu Leu Met Asn Val Lys Leu Ala Leu 
395 400 405 

gac att gag ate gec acc tac agg aaa ctg ttg gaa gga gag gag age 1361 

Asp He Glu He Ala Thr Tyr Arg Lys Leu Leu Glu Gly Glu Glu Ser 
410 415 420 

aga ctg tec age ggt gga get caa get acc att cat gtt cag cag acc 1409 

Arg Leu Ser Ser Gly Gly Ala Gin Ala Thr He His Val Gin Gin Thr 

425 430 435 440 

tec gga ggt gtt tea tct ggt tat ggt ggt age ggc tct ggt ttc ggc 1457 

Ser Gly Gly Val Ser Ser Gly Tyr Gly Gly Ser Gly Ser Gly Phe Gly 
445 450 455 

tac age agt ggc ttc age agt ggt ggg tea gga tac ggt agt gga tea 1505 

Tyr Ser Ser Gly Phe Ser Ser Gly Gly Ser Gly Tyr Gly Ser Gly Ser 
460 465 470 

gga ttc ggt tct gga tea ggg tat ggt gga ggc tec ate age aaa acc 1553 
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t 



Gly Phe Gly Ser Gly Ser Gly Tyr Gly Gly Gly Ser lie Ser Lys Thr 

475 480 485 

■agt gtc acc acc gtc age agt aaa cgc tat taa ggagaagece gcccaaaccc 1606 

Ser Val Thr Thr Val Ser Ser Lys Arg Tyr 

490 ■ 495 



ccagccgaca 
cgact caaac 
gatgtggacg 
tagttacccg 
ccaactacac 
tetgeactga 
ctatggtttc 
tttgtgctgt 
gctatggctg 
taactgatga 
ctgttacagc 
gtactttccg 
get tccagag 
atgttcaacc 
ataaatctga 



cagtt tccaa 
catctatggt 
atgattttta 
tatcatgeat 
taaaacgata 
tatgtacagg 
actggatcag 
tacaagctcc 
tcagagttgg 
caattgettg 
aaaaaacagg 
tgttggaagt 
catattttac 
agatttattt 
gaccacaaaa 



ccttccttac 
gctatatttt 
aagtacaaaa 
ctcctgtctg 
ccaggaagaa 
gaaagtgaga 
gggtgtgcta 
ctgctgtagt 
tttacct att 
tgggctacta 
cacaatggga 
ggtgactgta 
at ccccggtt 
ggtattgatt 
aaaaaaaaaa 



ctgeaactag 
agecagacag 
taagttttag 
gtggtgtcac 
tcgtgctcca 
cacatagaaa 
tacaagtt cc 
tttgetgact 
ttctataaaa 
atgtccagtt 
tttatgtgga 
ctgccttaca 
ataaatggaa 
gacgagacac 
aaaa 



atcccttctg 
ctgtcccctg 
attgtt ctgt 
tgecat tt ta 
agecactgaa 
ccactgtaac 
tgaatgtctt 
aatctgactt 
tgtatatggc 
acctcacatt 
ccatccct cc 
cattcccct g 
aatgcaagaa 
caactt gaaa 



aaccttct ta 
t taat gagga 
gtgttgatgg 
aatcat caac 
tagtcttatt 
etaegtagta 
gtttgaatgt 
ttgtcatttt 
agtcagccaa 
caagggagat 
ttaaccttgt 
tattcaactg 
aactgaaaca 
ttt gaataca 



1666 
1726 
1786 
1846 
1906 
1966 
2026 
2086 
2146 
2206 
2266 
2326 
2386 
2446 
2480 



<210> 2 

<211> 498 

<212> PRT 

<213> Danio 

<400> 2 
Met Ser Thr 
1 

Gly Tyr Thr 



rerio 



Arg Ser 
5 

Ser Gin 
20 



lie Ser Tyr Ser 



Ser Ala 



Met Ser Ser Val Thr Ser Val 
35 



Gly Phe Gly 
50 

Gly Gly Tyr 
65 

Phe Arg Cys 



Leu Ala Pro 



Thr Ser Glu 
115 

Leu lie Asp 
130 



Ala Gly 
Gly Ser 



Gly Leu 
85 

Leu Asn 
100 



Gly Ser 
55 

Gly Leu 
70 



Tyr Ala 

25 

Arg Arg 
40 

Tyr Ser 



Ser Gly Gly Ser 
10 

Val Pro Ala Gly 



Ser Gly Val Gly 
45 

Phe Ser Ser Ser 
60 



lie Arg Arg 

15 

Ser Thr Arg 
30 

Ala Ser Pro 



Ser Met Gly 



Gly Gly Gly Leu Gly Gly 
75 



Pro lie Thr Ala 



Leu Glu 



Lys Glu Gin lie 

Lys Val Arg Phe 
135 



lie Asp 
105 

Lys Thr 
120 



Val Thr Val Asn 
90 

Pro Thr He Gin 



Phe Asn Asn Arg 
125 



Gly Met Gly 
80 

Gin Asn Leu 
95 

Ala Val Arg 
110 

Phe Ala Phe 



Leu Glu Gin Gin Asn Lys Met Leu Glu 
140 
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Thr Lys Trp Ser Leu Leu Gin Glu Gin Thr Thr Thr Arg Ser Asn He 
145 150 155 160 

Asp Ala Met Phe Glu Ala Tyr He Ser Asn Leu Arg Arg Gin Leu Asp 
165 170 175 

Gly Leu Gly Asn Glu Lys Met Lys Leu Glu Gly Glu Leu Lys Asn Met 
180 185 190 

Gin Gly Leu Val Glu Asp Phe Lys Asn Lys Tyr Glu Asp Glu He Asn 
195 200 205 

Lys Arg Ala Ser Val Glu Asn Glu Phe Val Leu Leu Lys Lys Asp Val 
210 215 220 

Asp Ala Ala Tyr Met Asn Lys Val Glu Leu Glu Ala Lys Val Asp Ala 
225 230 235 240 

Leu Gin Asp Glu He Asn Phe Leu Arg Ala Val Tyr Glu Ala Glu Leu 
245 250 255 

Arg Glu Leu Gin Ser Gin He Lys Asp Thr Ser Val Val Val Glu Met 
260 265 270 

Asp Asn Ser Arg Asn Leu Asp Met Asp Ser He Val Ala Glu Val Arg 
275 280 285 

.Ala Gin Tyr Glu Asp He Ala Asn Arg Ser Arg Ala Glu Ala Glu ' Ser 
290 295 300 

Trp Tyr Lys Gin Lys Phe Glu Glu Met Gin Ser Thr Ala Gly Gin Tyr 
305 310 315 320 

Gly Asp Asp Leu Arg Ser Thr Lys Ala Glu He Ala Glu Leu Asn Arg 
325 330 335 

Met He Ala >Arg Leu Gin Asn Glu He Asp Ala Val Lys Ala Gin Arg 
340 345 350 

Ala Asn Leu Glu Ala Gin He Ala Glu Ala Glu Glu Arg Gly Glu Leu 
355 360 365 

Ala Val Lys Asp Ala Lys Leu Arg He Arg Glu Leu Glu Glu Ala Leu 
370 375 380 

Gin Arg Ala Lys Gin Asp Met Ala Arg Gin Val Arg Glu Tyr Gin Glu 
385 390 395 400 

Leu Met Asn Val Lys Leu Ala Leu Asp He Glu He Ala Thr Tyr Arg 
405 410 415 

Lys Leu Leu Glu Gly Glu Glu Ser Arg Leu Ser Ser Gly Gly Ala Gin 
420 425 430 

Ala Thr He His Val Gin Gin Thr Ser Gly Gly Val Ser Ser Gly Tyr 
435 440 445 



5 




Gly Gly Ser Gly Ser Gly Phe Gly Tyr Ser Ser Gly Phe Ser Ser Gly 
450 455 460 

Gly Ser Gly Tyr Gly Ser Gly Ser Gly Phe Gly Ser Gly Ser Gly Tyr 
465 470 475 480 

Gly Gly Gly Ser He Ser Lys Thr Ser Val Thr Thr Val Ser Ser Lys 
485 490 1 495 

Arg Tyr 



<210> 3 

<211> 1589 

<212> DNA 

<213> Danio rerio 

<220> 

<221> CDS 

<222> (86) . . (1231) 

<220> 

<221> primer_bind 
<222> (6) . . (26) 
<223> MCK2 

<220> 

<221> primer_bind 
<222> (20) . . (38) 
<223> MCK1 

<220> 

<221> polyA^signal 
<222> (1534) . . (1539) 

<400> 3 

cctatttcgg cttggtgaac aggatctgat cccaaggact gttaccactt ttgttgtctt 60 

ttgtgcagtg ttagaaaccg caatc atg cct ttc gga aac acc cac aac aac 112 

Met Pro Phe Gly Asn Thr His Asn Asn 

1 5 

ttc aag ctg aac tac tea gtt gat gag gag tat cca gac ctt age aag 160 

Phe Lys Leu Asn Tyr Ser Val Asp Glu Glu Tyr Pro Asp Leu Ser Lys 

10 15 20 25 

cac aac aac cac atg gec aag gtg ctg act aag gaa atg tat ggc aag 208 
His Asn Asn His Met Ala Lys Val Leu Thr Lys Glu Met Tyr Gly Lys 
30 35 40 

ctt agg gac aag cag acc cca cct gga ttc act gtg gat gat gtc ate 256 
Leu Arg Asp Lys Gin Thr Pro Pro Gly Phe Thr Val Asp Asp Val He 
45 50 55 

cag act ggt gtt gac aat cca ggc cac ccc ttc ate atg acc gtc ggc 304 
Gin Thr Gly Val Asp Asn Pro Gly His Pro Phe He Met Thr Val Gly 
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60 65 70 

tgt gtt get ggt gat gag gag tec tac gat gtt ttc aag gac ctg ttc 352 

Cys Val Ala Gly Asp Glu Glu Ser Tyr Asp Val Phe Lys Asp Leu Phe 

75 80 85 

gac ccc gtc att tec gac cgt cac ggt gga tac aag gca act gac aag 400 

Asp Pro Val 'He Ser Asp Arg His Gly Gly Tyr Lys Ala Thr Asp Lys 

90 95 100 105 

cac aag acc gac etc aac ttt gag aac ctg aag ggt ggt gat gac ctg 448 

His Lys Thr Asp Leu Asn Phe Glu Asn Leu Lys Gly Gly Asp Asp Leu 

HO 115 120 

gac ccc aac tac ttc ctg age age cgt gtg cgt acc gga cgc age ate ■ 496 

Asp Pro Asn Tyr Phe Leu Ser Ser Arg Val Arg Thr Gly Arg Ser lie 

125 130 135 



aag gga tac ccc ctg ccc ccc cac aac age cgt gga gag cgc aga get 
Lys Gly Tyr Pro Leu Pro Pro His Asn Ser Arg Gly Glu Arg Arg Ala 
140 145 150 



185 



cag ctg ate get gac cac ttc etc ttt gac aaa ccc gtc tec ccc ctg 
Gin Leu lie Ala Asp His Phe Leu Phe Asp Lys Pro Val Ser Pro Leu 
190 195 200 



aag cac aac cat ggg ttc atg tgg aac gag cat ctt ggt ttc gtc ctg 
Lys" His Asn His Gly Phe Met Trp Asn Glu His Leu Gly Phe Val Leu 
2^0 275 280 



544 



gtg gag aag ctg tct gtt gaa get ctg agt age ttg gat gga gag ttc 592 

Val Glu Lys Leu Ser Val Glu Ala Leu Ser Ser Leu Asp Gly Glu Phe 

155 160 165 

aag ggc aag tac tac ccc ctg aag tec atg act gat gac gag cag gag 640 

Lys Gly Lys Tyr Tyr Pro Leu Lys Ser Met Thr Asp Asp Glu Gin Glu 

170 , 175 180 



688 



ctg ctg get get ggt atg gee cgt gac tgg ccc gat gee aga ggc att 736 

Leu Leu Ala Ala Gly Met Ala Arg Asp Trp Pro Asp Ala Arg Gly lie 

205 ^210 215 

tgg cac aat gag aac aaa gee ttc ctg gtc tgg gtg aaa cag gag gat 784 

Trp His Asn Glu Asn Lys Ala Phe Leu Val Trp Val Lys Gin Glu Asp 
22 0 225 230 

cac ctg cgt gtc att tec atg cag aag ggt ggc aac atg aag gaa gtg 832 

His Leu Arg Val He Ser Met Gin Lys Gly Gly Asn Met Lys Glu Val 
235 240 245 

ttc aag cgc ttc tgc gtt ggt ctt cag agg att gag gaa att ttc aag 880 

Phe Lys Arg Phe Cys Val Gly Leu Gin Arg He Glu Glu He Phe Lys 
250 255 260 265 



928 



acc tgc ccc tec aac ctg ggc aca ggc ctg cgc ggt gga gtc cac gtc 976 
Thr Cys Pro Ser Asn Leu Gly Thr Gly Leu Arg Gly Gly Val His Val 
285 290 295 
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aag ctg ccc aag etc age aca cat gec aag ttt gag gag ate ctg acc 1024 

Lys Leu Pro Lys Leu Ser Thr His Ala Lys Phe Glu Glu He Leu Thr 

300 305 310 

aga ctg cgc ctg cag aag cgt ggc aca ggg ggt gtg gac acc get tec 1072 

Arg Leu Arg Leu Gin Lys Arg Gly Thr Gly Gly Val Asp Thr Ala Ser 
315 320 325 

gtt ggt gga gtg ttt gac att tec aac get gac cgt ate ggc tct tea 1120 

Val Gly Gly Val Phe Asp He Ser Asn Ala Asp Arg lie Gly Ser Ser 

330 335 340 345 

gag gtt gag cag gtg cag tgt gtg gtt gat ggt gtc aag ctg atg gtg 1168 

Glu Val Glu Gin Val Gin Cys Val Val Asp Gly Val Lys Leu Met Val 

350 355 360 

gag atg gag aag aag ctg gga gaa ggc cag tec ate gac age atg ate 1216 

Glu Met Glu Lys Lys Leu Gly Glu Gly Gin Ser He Asp Ser Met lie 

365 370 375 

cct gee cag aag taa agegggagge ccttccattt ttttcttcgt ctttgtctgt 1271 
Pro Ala Gin Lys 
380 



ttttttacag tccaacagca aegsagagga aaactgctgc tcaaaaagac agtctcacct 1331 

ttgcacctgt cttctttcct ttttttccct tcttctctaa tttccatgtc atttcgecat 1391 

ctttttttcc actttgtttc ctattaagtc ggtaacatct tgggatcaga tacccggsgc 1451 

aggagtgagt gcttgttgct gaggcttcac ctcaatttca gccttggttg taaaaagtga 1511 

atcaatcaaa gttgtatttc aaaataaaaa tccccaataa aaaaaaaaaa aaaaaaaaaa 1571 

aaaaaaaaaa aaaaaaaa 1589 



<210> 4 

<211> 381 

<212> PRT 

<213> Danio rerio 

<400> 4 

Met Pro Phe Gly Asn Thr His Asn Asn Phe Lys Leu Asn Tyr Ser Val 
15 10 15 

Asp Glu Glu Tyr Pro Asp Leu Ser Lys His Asn Asn His Met Ala Lys 
20 25 ■ 30 

Val Leu Thr Lys Glu Met Tyr Gly Lys Leu Arg Asp Lys Gin Thr Pro 
35 40 45 

Pro Gly Phe Thr Val Asp Asp Val He Gin Thr Gly Val Asp Asn Pro 
50 55 60 

Gly His Pro Phe He Met Thr Val Gly Cys Val Ala Gly Asp Glu Glu 
65 70 75 80 

Ser Tyr Asp Val Phe Lys Asp Leu Phe Asp Pro Val He Ser Asp Arg 
85 90 95 



8 




His Gly Gly Tyr Lys Ala Thr Asp Lys His Lys Thr Asp Leu Asn Phe 
100 105 110 

Glu Asn Leu Lys Gly Gly Asp Asp Leu Asp Pro Asn Tyr Phe Leu Ser 

115 120 125 

Ser Arg Val Arg Thr Gly Arg Ser He Lys Gly Tyr Pro Leu Pro Pro 
130 135 140 

His Asn Ser Arg Gly Glu Arg Arg Ala Val Glu Lys Leu Ser Val Glu 
145 150 155 160 

Ala Leu Ser Ser Leu Asp Gly Glu Phe Lys Gly Lys Tyr Tyr Pro Leu 
165 170 175 

Lys Ser Met Thr Asp Asp Glu Gin Glu Gin Leu He Ala Asp His Phe 
180 185 190 

Leu Phe Asp Lys Pro Val Ser Pro Leu Leu Leu Ala Ala Gly Met Ala 
195 200 205 

Arg Asp Trp Pro Asp Ala Arg Gly He Trp His Asn Glu Asn Lys Ala 
210 215 220 

Phe Leu Val Trp Val Lys Gin Glu Asp His Leu Arg Val He Ser Met 
225 230 235 240 

Gin Lys Gly Gly Asn Met Lys Glu Val Phe Lys Arg Phe Cys Val Gly 
24 5 250 255 

Leu Gin Arg He Glu Glu He Phe Lys Lys His Asn His Gly Phe Met 
260 265 270 

Trp Asn Glu His Leu Gly Phe Val Leu Thr Cys Pro Ser Asn Leu Gly 
275 280 285 

Thr Gly Leu Arg Gly Gly Val His Val Lys Leu Pro Lys Leu Ser Thr 
290 295 300 

His Ala Lys Phe Glu Glu He Leu Thr Arg Leu Arg Leu Gin Lys Arg 
305 310 315 320 

Gly Thr Gly Gly Val Asp Thr Ala Ser Val Gly Gly Val Phe Asp He 
325 330 335 

Ser Asn Ala Asp Arg He Gly Ser Ser Glu Val Glu Gin Val Gin Cys 
340 345 35,0 

Val Val Asp Gly Val Lys Leu Met Val Glu Met Glu Lys Lys Leu Gly 
355 360 365 

Glu Gly Gin Ser He Asp Ser Met He Pro Ala Gin Lys 
3^0 375 380 



<210> 5 



9 




<211> 1104 

<212> DNA 

<213> Danio rerio 

<220> 
<221> CDS 

<222> (75) . . (1034) 
<220> 

<221> primer__bind 
<222> (45) . . (64 ) 
<223> ARP2 

<220> 

<221> primer_bind 
<222> (87) . . (112) 
<223> ARP1 

<220> 

<221> polyA signal 
<222> (1069T- - (1074) 

<400> 5 

cgcgtcccta ccgtgagatt ttacaacctt gtctttaaac cggctgttca ccgatccttg 60 

gaagcactgc aaag atg ccc agg gaa gac agg gcc acg tgg aag tec aac 110 
Met Pro Arg Glu Asp Arg Ala Thr Trp Lys Ser Asn 
15 10 

tat ttt ctg aaa ate ate caa ctg ctg gat gac ttc ccc aag tgt ttc 158 
Tyr Phe Leu Lys lie lie Gin Leu Leu Asp Asp Phe Pro Lys Cys Phe 
15 20 25 

ate gtg ggc gca gac aat gtc ggc tec aag cag atg cag acc ate cgt 206 
lie Val Gly Ala Asp Asn Val Gly Ser Lys Gin Met Gin Thr lie Arg 
30 35 40 

ctg tec ctg egg ggc aag gcc gtc gtg etc atg ggg aaa aac acc atg 254 
Leu Ser Leu Arg Gly Lys Ala Val Val Leu Met Gly Lys Asn Thr Met 
4 ^ 50 55 60 

atg agg aag gcc att cgt ggc cac ctg gaa aac aac cca get ctg gag 302 
Met Arg Lys Ala lie Arg Gly His Leu Glu Asn Asn Pro Ala Leu Glu 
65 70 75 

agg ctg ctt ccc cac ate cgc ggg aac gtg ggc ttc gtc ttc acc aag 350 
Arg Leu Leu Pro His lie Arg Gly Asn Val Gly Phe Val Phe Thr Lys 
80 85 90 

gag gat ctg act gag gtc cga gac ctg ctg ctg gca aac aaa gtg ccc 398 
Glu Asp Leu Thr Glu Val Arg Asp Leu Leu Leu Ala Asn Lys Val Pro 
95 100 105 

get get gcc cgt get ggt gcc ate gcc ccc tgt gag gtg act gtg ccg 446 
Ala Ala Ala Arg Ala Gly Ala lie Ala Pro Cys Glu Val Thr Val Pro 
HO 115 120 
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* 



gcc cag aac acc ggg etc ggt cct gag aag acc tct ttc ttc cag get 
Ala Gin Asn Thr Gly Leu Gly Pro Glu Lys Thr Ser Phe Phe Gin Ala 
125 130 135 140 



494 



ttg gga ate acc acc aag ate tec aga gga acc att gaa ate ttg agt 
Leu Gly He Thr Thr Lys He Ser Arg Gly Thr He Glu He Leu Ser 
145 150 155 



542 



gac gtt cag ctt ate aaa cct gga gac aag gtg ggc gcc age gag gcc 
Asp Val Gin Leu lie Lys Pro Gly Asp Lys Val Gly Ala Ser Glu Ala 
160 165 170 



590 



acg ctg ctg aac atg ctg aac atg ctg aac ate teg ccc ttc tec tac 
Thr Leu Leu Asn Met Leu Asn Met Leu Asn He Ser Pro Phe Ser Tyr 
175 , . 180 185 



638 



ggg ctg ate ate cag cag gtg tat gat aac ggc agt gtc tac age ccc 
Gly Leu He He Gin Gin Val Tyr Asp Asn Gly Ser Val Tyr Ser Pro 
190 195 200 



686 



gag gtg ctg gac ate act gag gac gcc ctg cac aag agg ttc ctg aag 
Glu Val Leu Asp Ile^Thr Glu Asp Ala Leu His Lys Arg Phe Leu Lys 
205 210 215 220 



734 



ggt gtg agg aac ate gcc agt gtg tgt ctg cag ate ggc tac cca act 
Gly Val Arg Asn He Ala Ser Val Cys Leu Gin He Gly Tyr Pro Thr 
225 230 235 



782 



ctt get tec ate cct cac act ate ate aat gga tac aag agg gtc ctg 
Leu Ala Ser He Pro His Thr He He Asn Gly Tyr Lys Arg Val Leu 
240 245 250 



830 



get gtc act gtc gaa aca gac tac aca ttc ccc ttg get gag' aag gtg 
Ala Val Thr Val Glu Thr Asp Tyr Thr Phe Pro Leu Ala Glu Lys Val 
255 260 265 



878 



aag gcc tac ctg get gat ccc acc get ttc get gtt gca gcc cct gtt 
Lys Ala Tyr Leu Ala Asp Pro Thr Ala Phe Ala Val Ala Ala Pro Val 
270 275 280 



926 



gcg gca get aca gag cag aaa tec get get cct gcg get aaa gag gag 
Ala Ala Ala Thr Glu Gin Lys Ser Ala Ala Pro Ala Ala Lys Glu Glu 
285 * 290 295 300 



974 



gca ccc aag gag gat tct gag gag tct gat gaa gac atg ggc ttc ggc 
Ala Pro Lys Glu Asp Ser Glu Glu Ser Asp Glu Asp Met Gly Phe Gly 
305 310 315 



1022 



ctg ttt gat taa accagacacc gaatatccat gtctgtttaa catcaataaa 
Leu Phe Asp 



1074 



acatctggaa acaaaaaaaa aaaaaaaaaa 



1104 



<210> 6 
<211> 319 
<212> PRT 



1 1 



<213> Danio rerio 



<400> 6 

Met Pro Arg Glu Asp Arg Ala Thr Trp Lys Ser Asn Tyr Phe Leu Lys 
15 10 15 

lie lie Gin Leu Leu Asp Asp Phe Pro Lys Cys Phe lie Val Gly Ala 
20 25 30 

Asp Asn Val Gly Ser Lys Gin Met Gin Thr lie Arg Leu Ser Leu Arg 
35 40 45 

Gly Lys Ala Val Val Leu Met Gly Lys Asn Thr Met Met Arg Lys Ala 
50 55 60 

lie Arg Gly His Leu Glu Asn Asn Pro Ala Leu Glu Arg Leu Leu Pro 
65 70 75 80 

His lie Arg Gly Asn Val Gly Phe Val Phe Thr Lys Glu Asp Leu Thr 
85 90 95 

Glu Val Arg Asp Leu Leu Leu Ala Asn Lys Val Pro Ala Ala Ala Arg 
100 105 110 

Ala Gly Ala lie Ala Pro Cys Glu Val Thr Val Pro Ala Gin Asn Thr 
115 120 125 

Gly Leu Gly Pro Glu Lys Thr Ser Phe Phe Gin Ala Leu Gly lie Thr 
130 135 140 

Thr Lys lie Ser Arg Gly Thr lie Glu lie Leu Ser Asp Val Gin Leu 
145 150 155 160 

lie Lys Pro Gly Asp Lys Val Gly Ala Ser Glu Ala Thr Leu Leu Asn 
165 170 175 

Met Leu Asn Met Leu Asn lie Ser Pro Phe Ser Tyr Gly Leu lie lie 
180 185 190 

Gin Gin Val Tyr Asp Asn Gly Ser Val Tyr Ser Pro Glu Val Leu Asp 
195 200 205 

lie Thr Glu Asp Ala Leu His Lys Arg Phe Leu Lys Gly Val Arg Asn 
210 215 220 

lie Ala Ser Val Cys Leu Gin lie Gly Tyr Pro Thr Leu Ala Ser lie 
225 230 235 240 

Pro His Thr lie lie Asn Gly Tyr Lys Arg Val Leu Ala Val Thr Val 
245 250 255 

Glu Thr Asp Tyr Thr Phe Pro Leu Ala Glu Lys Val Lys Ala Tyr Leu 
260 265 270 

•Ala Asp Pro Thr Ala Phe Ala Val Ala Ala Pro Val Ala Ala Ala Thr 
275 280 285 
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Glu Gin Lys Ser Ala Ala Pro Ala Ala Lys Glu Glu Ala Pro Lys Glu 

290 295 300 

Asp Ser Glu Glu Ser Asp Glu Asp Met Gly Phe Gly Leu Phe Asp 

305 310 315 




<210> 7 

<211> 2241 

<212> DNA 

<213> Danio rerio 

<220> 

<221> TATA__signal 
<222> (2103) . . (2108) 

<220> 

<221> primer^bind 
<222> (2221) . . (2241) 
<223> CK2 

<220> 

ja <221> misc_feature 

I <222> (2142) . . (2235) 1 

| <223> Identical to the 5' CK cDNA 

| <400> 7 

j; ccttcccttc tacttttgac gtccttttaa gagcttgtgc atgaaagcag atttggagct 60 
I gattactcat ctcaaacacc catacaaagg gatgattgcc gtaccatgat ctcacacctt 120 
: ! tcacacctgg tttatactat gatagttgta gacgattgcg taatgctatt aaatgcccat 180 
■ cagtgctggc tgtgacaccc aactgctgcc atttcgtgtt gacttgcacg agaaatgaga 240 
aattgtctga ctatgcaggg tgtctatgcg tgggaacatt tatcagtggt cattaaatac 300 
tatagtttac agttagacca aagtgtgctg tatttttgtg ttagcttagc tgcagttttt 360 
gtgtgtgaag taacaaatga caaatactca aactattgta attaagtagt ttttctcaga 420 
aattgtaatt tactaagtag tttaaaaatg tgtactttta ctttcccttg agtacatttt 480 
tagtgcagtg ttggtacttt tatttcactt ccttccttca acctgcagtc actactttat 540 
ttattcttgt ctatgtggat tagacaaatc agtcctgtga ttcctgtcca atcaaattgc 600 
acatagaagg taaatcacat cataatgaac taccttaaga catgggccat ttataattgc 660 
f agcaaactgt ttgccagcat taaaagaaga tgtcaaaaat atttacacgc attaacccag 720 
agactgctta gatgcatgtc actgatgaga agatgatgga tgtttactgt atgatgaccg 780 
aaataacttt aaacgcacac aagacggcac aagacgtcaa catggcgtta ggttgacgtt 840 
gtaccccaac gcagtgggga cgttgcattt tgtttagaaa tgaaaattag gttgacgtca 900 
gaactcaacg tcaggtcgat gtcaatgttc aacatccaat ctaaaatcat atatcaatgt 960 
ctaatgatgt tacagcttga tgttatgcgg atgttacccc tatgacgtct atcagacgtt 1020 
ggattatggt tgccatacct gatgaataaa tgtcattatt tgacgttggt ttaagatgtt 1080 
ggttcgacat tggattttgg tcgctttcca acacaaccta aatccaccaa atattaactt 1140 
cctatgacat cgttattgga cgtcaaaata acaatatcct tagatgctgg ctagactttg 1200 
aatttaggtc accacaacct atatttaacc taatattaac atcttatgat gttgtgtgcc 1260 
tgctgggcaa taactaaatg cactacagaa tgttacgttt acacacatgt aaattacatg 1320 
taaatgcatc agcttttcac agcataatac tcactactta ctactcttga gtacttttaa 1380 
aaaagctact tttcactcat actttgagta atatttacaa ctgatacttt tactcgcact 1440 
acatttttag gcatgtattg atatttttac tatgattttt cagtactctt tccactactg 1500 
cagccctccc catacataat cgtatgttta cacatatggt ggagtttaga gccataatct 1560 
acattagctt tgttagccgc tagcattact gtgcagaatt gtgtgtgtgc acattttcca 1620 
atatcaatac agaaggaaac tgtgttccct gttcccttgt aaatctcaac aatgcaactg 1680 
ttcagctcag ggggaaaaat gccctgccag atccaaacgg ctggcaaaag tgaatggaaa 1740 
aaagcctttc attaatgtga aagttgctgc gcgccccacc cagataaaaa gagcagaggt 1800 



13 



taacatgctc tctacggctg tccagccaac cagatactga ggcagaaaca cacccgctgg 1860 

cagatggtga gagctacact gtcttttcca gagtttctac tggaatgcct gtcctcaagt 1920 

ctcaagcctc tccttgcatt ctctcattcc acctggggca aagccccagg ctgggtgtga 1980 

caacatttat cttaccactt tctctctgta cctgtctaac aggtagggtg tgtgtgagag 2040 

tgcgtatgtg tgcaagtgcg tgtgtgtgtg agagcagtca gctccaccct ctcaagagtg 2100 

tgtataaaat tggtcagcca gctgctgaga gacacgcaga gggactttga ctctcctttg 2160 

tgagcaacct cctccactca ctcctctctc agagagcact ctcgtacctc cttctcagca 2220 

actcaaagac acaggatccg g 2241 




<210> 8 

<211> 1456 

<212> DNA 

<213> Danio rerio 

<220> 

<221> TATA_signal 
<222> (1389) . . (1394 ) 

<220> 

<221> primer_bind 

<222> (1433)7. (1456) 

<223> MCK2 

<220> 

<221> misc^feature 
<222> (1428) . . (1453) 

<223> Identical to the 5' MCK cDNA 
<400> 8 

gaattgcaaa gtcagagtaa taaaatgaaa ccaaaaaaca tttttaaata tacttgtctc 60 

tgtggcttaa tcttggctga tgtgtgtgtg tgtgtgtgtg tacttgacag ctgctagtga 120 

gcatgtgcac catgacaggc ctgttattca cacttggtgc catgttggag actgttcggc 180 

cagctatagt tttcttcaca gagtcctggg tcacctaatg tcacaaggaa gaaacatgtt 240 

acatgttaaa atgtgacatt caaattgtag tgcattactt aacgaaacgc attacacaag 300 

ttacagctta aaagattgct agacagaaaa accagggagg ggttttccca taatatccag 360 

tgagactcta ggagcgggaa cactaacagg cctccctgag tgagaacatt gcatgtgcgc 420 

gtgacagaaa accagagatg gaaatacctt cttttgaatt gcataattgc ttaaaagaag 480 

acacaacagg gatagttcac ccaaaaaaca gaccattctt tttttctgtt gaacaaaaat 540 

taagatattt tgaagaatgc ttaccgaata acttccatat ttggaaacta attacagtga 600 

aagtcaatgg gtcttccagc attttttcaa tataccttac tttgagttca aaagaaaaac 660 

acatctcaaa taggtttgag gttgaataaa catttttcat tttggggtgg actatcccta 720 

attatttgac acttaagatt tatagtaaat cattttatag actttctccc cttattaaac 780 

atggttgaat ttatcttcat gtttatgtct gggttgtgct tttttgaaaa gatttccctg 840 

tcaaatgttt ttgtgtatgg ttggcgcaca atagactgaa ctggcctatc acacagactt 900 

tcataacaac tccagttgat gccctttcac cctcagtgta taaatatggc gtctgacatg 960 

agcagattaa acacgacact gcaacaactt tacctgtaaa aatacaaatt gagtttgcac 1020 

ccagaatcat gtggtgaacg aagcctacca agagattttt gaaagccatc ggcctgacac 1080 

gcgcacttct gatatctgtg gtatgtttgg caaaagtgct gctcagcctt tttagcatgg 1140 

cagatcctcc acatcccatc acccctcctt caacctattc cctcctggaa agctatgtat 1200 

ggggcgggaa gtgtaaatgg atatgggaag gaaggggggc accacccaca gctgccacct 1260 

catctaggat gcctggggcc taaattgaag cctttcttac actaaacagg gcataagaga 1320 

ccagcgccag ccaatcataa ttcagtgagc tctaaaatgg gccagccaat ggctgcaggg 1380 

gctagaggta tatatatcca aatcaaactc ttcttgcttg ggtgacccct atttcggctt 1440 

ggtgaacagg atccgg 1456 
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<210> 9 

<211> 2205 

<212> DNA 

<213> Danio rerio 



<220> 

<221> primer bind 
<222> (2179)7. (2205) 
<223> ARP2 



<220> 

<221> misc_f eature 
<222> (2153) . . (2199) 

<223> Identical to the 5' ARP cDNA 



<220> 

<221> intron 

<222> (792) . . (2152) 

<220> 

<221> misc_feature 
<222> (775) . . (791) ' 

<223> Identical to the 5 ? ARP cDNA 



<400> 9 

atctgtatta 

attttaattt 

agaaataata 

gaaagagaaa 

tga ttgtcaa 

aga taataaa 

aatacattca 

gtgcagattt 

atttacattc 

tcaaccctca 

tatattgacc 

cagacacgtg 

atgatcgact 

cctaccgtga 

taacatcata 

tatgcgttta 

caggcctcgt 

tccgattctg 

ttacatgcta 

tt ttaaaacc 

tcaatcacat 

tttatattag 

gtaaagttaa 

tcaaatctta 

accataatac 

ttcaaaatat 

tgcataaatc 

gcagtggtca 

taattaaaca 

gcaggtgtgt 

gtgacccctg 

ttatggacag 



agaaacactt 
gtgttgtata 
caaatttgtt 
gagt t ttatt 
gatgataaaa 
gtgacttttt 
ttaaaataaa 
ttggctgttg 
tttgctatta 
cagtaatgta 
gcagctgtat 
cattgaccaa 
gatactaata 
ggtaaggctg 
agaggtcacg 
aagcttgtgt 
gcgccatgtg 
atattaatca 
gtactt gtgc 
agttactctc 
ataattgtgt 
tatgt acaat 
tcttaaaggg 
tgagtttctt 
aaatacagga 
ctacacaagt 
agttttcatt 
ccaagcttgt 
cacct gaaca 
tgatgcaaga 
cctcaagcca 
gctgttgcag 



aaaatatata 
attttacatt 
t acagtattc 
ataagatgtt 
gtcagtttag 
tgggcggacc 
ggtattgcga 
ttagaaggga 
aattatccat 
aaaatatt at 
cctttctaag 
tcagcgcaca 
ttgtgccgct 
acgccgctct 
agaaggtcta 
aatgattttt 
tgacgcgacg 
tatttatgcg 
tagtcggtcg 
attttagtga 
ttatgtttta 
ttggcataaa 
gtaaaggct c 
aatgaacatg 
aaaatatact 
gtttaatgga 
tgggtgagct 
tcctgaaggg 
agctaatcaa 
tagagctaaa 
tcacaaatgc 
tgcttgttcg 



tgcgttacga 
ttgtaagt at 
ttagttattg 
caattt aaaa 
acaaaaaaat 
aaatttccct 
tgaatttaga 
tacatctgcg 
tatttgtatt 
ttattgtttt 
tgcgactgta 
gatacgcatt 
t cct ttcgcg 
tgtggcggtt 
cgtgtgttta 
acagtaaaag 
tt ttaat age 
taaaatgtgt 
atccacattg 
aatattctta 
tttgagtcat 
ctgeett egg 
acccaaaaga 
gtatgttttg 
atagaagt eg 
aggaactcaa 
gtctctaaac 
ccagtgtcct 
ggtcttacta 
ccctgcaggg 
attatggt at 
tcgttcccac 



attaaaaaca 
tatttttata 
caataaacga 
aatggcagaa 
aagatgaaaa 
attaatggtc 
tgcacagtga 
gecgaaagtt 
ttattacccc 
at agegtcac 
caaatacgea 
tt ccgcgcga 
gcctctttct 
tcttaaaatg 
atatcagegg 
ttagcactag 
atcttatttg 
gatgggtctg 
agatgttgcg 
agecactaag 
cataccaggt 
ttttgattga 
caattcaccg 
gagaaaact g 
atggttacag 
gtgatttgaa 
atttgattta 
acagatttta 
ggtatgtttg 
acaatggccc 
taagaaatgt 
tgcacaaatg 



aaacacgatc 
aaaaatatat 
attttatata 
aatagaaaaa 
acatcaaaat 
aattcattaa 
ttttggttct 
aacgggaact 
aacegtaaac 
agaatgatgc 
ctgaccgtga 
ttctgattgg 
ttcacgcgtc 
tgttaataaa 
eggttattat 
cctgttagca 
att ttgatga 
ctagtggaca 
ct atttgeca 
ttaaaatttg 
aatagtttta 
catctacttt 
tcaagtgttt 
gaaaccaact 
gttttctgea 
aagttaaggg 
gacacctcag 
gctccaaccc 
aaacatccag 
aacaggattg 
gcaggttcag 
aacatgattc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 
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cttctatccc tgtctgtctg catctcatga cttgcaggga cgctggtctc agacacgttt 1980 

atagcagtaa atcaaataca atagtgctct gattatcttt aaatatttga aagcttataa 2040 

taggcaacca aattacctgg aaacagttta caaacagtaa ttcatatttt gtcatttaat 2100 

aagatgcaca caaggcaggt gtaaaagtat tgcttgtgtt tgtaatcctc agattttaca 2160 

accttgtctt taaaccggct gttcaccgat ccttggaagg gatcc 2205 



<210> 10 

<211> 24 

<212> DNA 

<213> Artificial Sequence 

<220> • 

<223> 'Description of Artificial Sequence: Cytokeratin 

. gene specific primer 

" <4'00> 10 

cgctggagta agagatagac ctgg 24 




<210> 11 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cytokeratin 
gene specific primer 

<220> 

<221> misc_f eature 
<222> (1) . . (6) 

<223> Introduced for restriction site 
<220> 

<221> misc_f eature 
<222> (3) . . (8) 
<223> BamHI site 

<400> 11 

ccggatcctg tgtctttgag ttgctg 26 



<210> 12 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Muscle 
creatine kinase gene specific primer 

<220> 

<221> misc_feature 
<222> (3) . . (8) 
<223> BamHI site 
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<400> 12 

ccggatcctt gggatcagat cctg 



24 



<210> 13 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Muscle 
creatine kinase gene specific primer 

<220> 

<221> misc_f eature 
<222> (1) .7(3) 

<223> Introduced for restriction site 
<220> 

<221> misc feature 
<222> (3) .7(8) 
<223> BamHI site 

<400> 13 

ccggatcctg ttcaccaagc cgaa 24 



<210> 14 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Acidic 
ribosomal protein PO" gene specific primer 



<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Acidic 
ribosomal protein PO gene specific primer 

<220> 

<221> misc_feature 
<222> (1) . . (7) 

<223> Introduced for restriction site 
<220> 

<221> misc feature 



<400> 14 

tagttggact tccacgtgcc ctgtc 



25 



17 



<222>' (1) . . (6) 
<223> BamHI site 

<400> 15 

ggatcccttc caaggatcgg tgaaca t 26 



<210> 16 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Oligonucleotide for linker used in linker-mediated 
PGR 

<400> 16 

gttcatcttt acaagctagc gctgaacaat gctgtggaca agcttgaatt c 51 




<210> 17 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Oligonucleotide for linker used in linker-mediated 
PCR 

<220> 

<221> misc_f eature 

<222> (10) . . (10) 

<223> n is a dideoxycyt idine 

<400> 17 

gaattcaagn 10 



<210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker 
specific primer 

<400> 18 

gttcatcttt acaagctagc g 21 



<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: linker 
specific primer 

<400> 19 

tcctgaacaa tgctgtggac 



<210> 20 

<211> 1392 

<212> DNA 

<213> Danio rerio 

<220> 

<221> CDS 

<:222> (42) . . (551) 

<220> 

<221> priraer_bind 

„ <222> (6) . . (28) 

; ' ! <2 2 3> M2 

<220> 

<221> primer_bind 
<222> (23) . . (45) 
<223> Ml 

<220> 

<221> polyA_signal 
<222> (797) . . (802) 

<220> 

<221> polyA_signal 
<222> (1351) . . (1357) 

<400> 20 

ctcttcttga tcttcttaga cttcacacat accgtctcga c atg gca ccc aag aag 56 

Met Ala Pro Lys Lys 
1 5 



gcc aag agg agg gca gca gga gga gag ggt tec tec aac gtc ttc tec 104 

Ala Lys Arg Arg Ala Ala Gly Gly Glu Gly Ser Ser Asn Val Phe Ser 

10 15 20 

atg ttt gag cag age cag att cag gag tac aaa gag get ttc aca ate 152 

Met Phe Glu Gin Ser Gin lie Gin Glu Tyr Lys Glu Ala Phe Thr lie 

25 30 35 

att gac cag aac aga gac ggt ate ate age aaa gac gac ctt agg gac 200 

lie Asp Gin Asn Arg Asp Gly lie lie Ser Lys Asp Asp Leu Arg Asp 

40 45 50 

gtg ttg gcc tea atg ggc cag ctg aat gtg aag aat gag gag ctg gag 248 

Val Leu Ala Ser Met Gly Gin Leu Asn Val Lys Asn Glu Glu Leu Glu 
55 60 65 



19 



gcc atg ate aag gaa gec age ggc cca ate aac ttc acc gtt ttc etc 296 

Ala Met He Lys Glu Ala Ser Gly Pro He Asn Phe Thr Val Phe Leu 

70 75 80 85 

acc atg ttc gga gag aag ttg aag ggt get gac ccc gaa gac gtc ate 344 

Thr Met Phe Gly Glu Lys Leu Lys Gly Ala Asp Pro Glu Asp Val He 

90 95 100 

gtg tct gcc ttc aag gtg ctg gac cct gag ggc act gga tec ate aag 392 

Val Ser Ala Phe Lys Val Leu Asp Pro Glu Gly Thr Gly Ser He Lys 

105 110 115 

aag gaa ttc ctt gag gag ctt ttg acc act cag tgc gac agg ttc acc 440 

Lys Glu Phe Leu Glu Glu Leu Leu Thr Thr Gin Cys Asp Arg Phe Thr 

120 125 130 

gca gag gag atg aag aat ctg tgg gcc gcc ttc ccc cca gat gtg get 488 

*Ala Glu Glu Met Lys Asn Leu Trp Ala Ala Phe Pro Pro Asp Val Ala 

135 140 145 



ggc aat gtt gac tac aag aac ate tgc tac gtc ate aca cac gga gag 
Gly Asn Val Asp Tyr Lys Asn He Cys Tyr Val He Thr His Gly Glu 
150 155 160 165 



536 



gag aag gag gag taa acaaccttgg aatcaagaaa acgaagagaa gaacatgeat 
Glu Lys Glu Glu 



591 



cctcacagct 
ccctttcttg 
agacttgt ct 
cgagtgggga 
ctctctttct 
cttgcatgta 
tagtttgggg 
aagaagcacg 
ctcctctggt 
tagcctccta 
aggaagagag 
cctctt ctct 
tctgggcctg 
aataaaaaaa 



taatct ccag 
cttt ctacca 
cgctgggact 
catgggattg 
ctctgtttct 
agtegctget 
ctgaaacatc 
atggagtgat 
ggtt gactaa 
aaagcgatgc 
gagagcaggg 
atttctctgt 
etttgetctg 
aaaaaaaaaa 



tctgttgtct 
tcgttgttac 
gaattgggag 
t t ttcaataa 
cactcattac 
t cttct gctg 
tctctagatc 
ctcactctat 
aaagagaaag 
cgagctcatc 
actgaaagaa 
ccatctt ttc 
ccaaacctct 
a 



ggcett ctct 
tccaagcact 
ggtggagagg 
aatgaacatc 
ccacaacccc 
cagtcttagg 
aatgtggaag 
aatagaggaa 
agaaaatgag 
acagagggag 
aacataacct 
ttttttcttt 
cctgtaacca 



aactt ttgtt 
tacactctcc 
aacacgacca 
atttctgtat 
t ctcttt cat 
agttgaaacg 
agtgctcact 
ccagt catca 
ggttttgtgc 
tgagagggac 
cttcactccc 
tttctttttt 
ataaaaagac 



t ttccttcct 
atcttaccaa 
cagtgtctgt 
ctctcacatt 
ttcagtcaag 
aaggca tcta 
ctgaggggga 
ttctcatttc 
tgagtgagtt 
agaccatcct 
cctctcccct 
getttctgea 
acaaactgtg 



651 

711 

771 

831 

891 

951 

1011 

1071 

1131 

1191 

1251 

1311 

1371 

1392 



<210> 21 

<211> 169 

<212> PRT 

<213> Danio rerio 

<400> 21 

Met Ala Pro Lys Lys Ala Lys Arg Arg Ala Ala Gly Gly Glu Gly Ser 
15 10 15 

Ser Asn Val Phe Ser Met Phe Glu Gin Ser Gin He Gin Glu Tyr Lys 
20 25 30 

Glu Ala Phe Thr He He Asp Gin Asn Arg Asp Gly He He Ser Lys 
35 40 45 
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Asp Asp Leu Arg Asp Val Leu Ala Ser Met Gly Gin Leu Asn Val Lys 
50 55 60 

Asn Glu Glu Leu Glu Ala Met lie Lys Glu Ala Ser Gly Pro He Asn 
65 70 75 80 

Phe Thr Val Phe Leu Thr Met Phe Gly Glu Lys Leu Lys Gly Ala Asp 
85 90 95 

Pro Glu Asp Val He Val Ser Ala Phe Lys Val Leu Asp Pro Glu Gly 
100 105 110 

Thr Gly Ser lie Lys Lys Glu Phe Leu Glu Glu Leu Leu Thr Thr Gin 
115 120 125 

Cys Asp Arg Phe Thr Ala Glu Glu Met Lys Asn Leu Trp Ala Ala Phe 
130 135 140 

Pro Pro Asp Val Ala Gly Asn Val Asp Tyr Lys Asn lie Cys Tyr Val 
145 150 155 160 

He Thr His Gly Glu Glu Lys Glu Glu 
165 



<210> 22 

<211> 2054 

<212> DNA 

<213> Danio rerio 

<220> 

<221> TATA_signal 
<222> (1983) . . (1989) 

<220> 

<221> enhancer 
<222> (142) . . (148) 
<223> E-box, canntg 

<220> 

<221> enhancer 
<222> (452) . . (457) 
<223> E-box, canntg 

<220> - 

<221> enhancer 
<222> (1095) . . (1100) 
<223> E-box, canntg 

<220> 

<221> enhancer 
<222> (1278) . . (1283) 
<223> E-box, canntg 

<220> 

<221> enhancer 
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<222> (1362) . . (1367) 
<223> E-box, canntg 

<220> 

<221> enhancer 
<222> (1385) . . (1390) 
<223> E-box, canntg 

<220> 

<221> enhancer 

<222> (523) . . (532) 

<223> Potential MEF2 binding site, yta (w) 4tar 
<220> 

<221> enhancer 

<222> ( 606) . . (615) 

<223> Potential MEF2 binding site, yta(w)4tar 
<220> 

<221> enhancer 

<222> (697) . . (706) 

<223> Potential MEF2 binding site, yta(w)4tar 
<220> 

<221> enhancer 

<222> (1490) . . (1499) 

<223> Potential MEF2 binding site, yta(w)4tar 
<220> 

<221> enhancer 

<222> (1640) . . (1649) _ 

<223> Potential MEF2 binding site, yta (w) 4tar 



<220> 

<223> Transcription start site at residue 2012 
<220> 

<221> primer bind 
<222> (2032) . . (2054 ) 
<223> M2 

<220> 

<221> misc_dif f erence 
<222> (2027) . . (2054) 

<223> Identical to the 5' MLC2 f cDNA 
<400> 22 

tgcatgcctg gcaggtccac tctagaggac tactagtcat atgcgattct gaacaatgct 60 
gtggacagct tgaattcaat tcgccacaga ggaatgagcc accaactcat ccagtgtatt 120 
ttttatgcag cggatgccca tccagttgca accctacact gggaaacacc caaatctgtc 180 
ttttaaatta atatttttaa taggaagcta tgttatattt gtgcatatac attagattag 240 
tcagtacact ctcagaaata aatggtccgc aagctgtcac tgcggtggta ccttttcaaa 300 



}<220> l " s.* 

t<221> enhancer 

1 <222> (1956) . . (1965) 

<223> Potential MEF2* binding site, yta (w) 4tar 



22 




aggtacatat ttgtacttaa agggtccata ttgatacctc aaaagtatat attagtgcct 360 

aaaaatttta agagaaacac ttttgtactt tttaggtact aatatatacc cttgaggttt 420 

tattatggac ctttaaggta caaattttta tcatttggaa aggtaccacc ccagtgacag 480 

ctcacgccct ttatttctga gagtgaagct aaatctggag cttatttaac aaaataactt 540 

atgataacgg tccaaaaact actacaccca tatttatatg ttattgaaaa atattaaata 600 

caaatttata aacaggggaa aatcaagaga agcaaaacaa tggaaaaaaa ttagttcaaa 660 

atttgtaggt tgtaattttt ttgttgcaat atttagcttt atttaaattg ttttatcttt 720 

caattttgtg actaaaatat tattttaata aatatagctg tttaataaat ctgttttgtt 780 

taaatgcacc aaaatacatt gcctatattc actgagagat ggagaaaaat attcattttc 840 

aaaatgggct gtactcaatt actctaagca ctgtaattat ctgtttttct aggtactgta 900 

aaataatgct aatcttaaac aattttggaa caagcaggag caaaacaagc cgacagactc 960 

acaggatatc aaattagtct aaagaaagaa gggaatgcac aacaaactca aggggggcaa 1020 

aacagtgact gatgaattag acaagaaaaa gagagcaagg agcgctcaag attgtttagc 1080 

tattttggtc accacagctg ttccttatgc ctgccttccc aaaaaaaaac tgtcttaagc 1140 

ctcaaatttc tcttcatgag ggtccaacat caaccactca gagggctgta gtgtgctgac 1200 

catctaaaaa ctgggaaaaa ggggtaatta cgtgcttgtc cacagggcag cttcccacaa 1260 

atggcacctc acagtcactg aagtgaccgg gtgaggtcat aggtcgatcg gcagagagag 1320 

agagagagag agagagagag agtgctgaat atgggcatgg ccatgtggac gggtgtggtg 1380 

ggggcacttg aaccgaaatc ttacagcatc actatactag gaaaagcatt aaaacctatt 1440 

ttcgattagg gctgatttga aataagggtt aagacaccag aacgtcctct tatatatcag 1500 

ccagggcgtg aacaaatatg aacaacataa tcattggctc aaaaatctct ggattgaaat 1560 

ccatcaggat ctatcactgc aaccctcccc atccaacaag taatcctgca agataagcca 1620 

gtattgatct gctgctaatc taactttagg ggagggagtt gcgtcccctt tagactcagt 1680 

ggctacagct cattcatttc aaattgagtt atgtgattgt atgaagctca aacagtccct 1740 

tacgtcccca tgtccttatt agacaacgcg agacatgcag gccgctgcca tcagtatcag 1800 

attcatccca ttccaagact ccaatagcta tttctgagca ctgtaagatg atagtacatc 1860 

ccagccggtg tccctccatc actttccccc tacctcatag tttttcctct ttctctctcg 1920 

gtctgctatt tcccaaacct cacttaaggt tgggtctata attagcaagg ggccttcgtc 1980 

agtatataag cccctcaagt acaggacact acgcggcttc agacttctct tcttgatctt 2040 

cttagacttc acac 2054 

<210> 23 
<211> 23 
<212> DNA 

<213> Artificial Seguence 
<220> 

<223> Description of Artificial Sequence: MLC2F gene 
specific primer Ml 

<400> 23 

ccatgtcgag acggtatgtg tga 23 



<210> 24 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: MLC2F gene 
specific primer M2 

<400> 24 

gtgtgaagtc taagaagatc aag 23 
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CHIMERIC GENE CONSTRUCTS FOR GENERATION OF FLUORESCENT 
TRANSGENIC ORNAMENTAL FISH 

FIELD OF THE INVENTION 

This invention relates to fish gene promoters and chimeric gene constructs with these 
5 promoters for generation of transgenic fish, particularly fluorescent transgenic ornamental 



BACKGROUND OF THE INVENTION 

Transgenic technology involves the transfer of a foreign gene into a host organism 
enabling the host to acquire a new and inheritable trait. The technique was first developed 

10 in mice by Gordon et al. (1980), They injected foreign DNA into fertilized eggs and found 
that some of the mice developed from the injected eggs retained the foreign DNA. 
Applying the same technique, Palmiter et al. (1982) have introduced a chimeric gene 
containing a rat growth hormone gene under a mouse heavy metal-inducible gene promoter 
and generated the first batch of genetically engineered supermice, which are almost twice 

15 as large as non-transgenic siblings. This work has opened a promising avenue in using the 
transgenic approach to provide to animals new and beneficial traits for livestock husbandry 
and aquaculture. 

In addition to the stimulation of somatic growth for increasing the gross production 
of animal husbandry and aquaculture, transgenic technology also has many other potential 

20 applications. First of all, transgenic animals can be used as bioreactors to produce 
commercially useful compounds by expression of a useful foreign gene in milk or in blood. 
Many pharmaceutically useful protein factors have been expressed in this way. For 
example, human al-antitrypsin, which is commonly used to treat emphysema, has been 
expressed at a concentration as high as 35 mg/ml (10% of milk proteins) in the milk of 

25 transgenic sheep (Wright et al., 1991). Similarly, the transgenic technique can also be used 
to improve the nutritional value of milk by selectively increasing the levels of certain 
valuable proteins such as caseins and by supplementing certain new and useful proteins 
such as lysozyme for antimicrobial activity (Maga and Murray, 1995). Second, transgenic 
mice have been widely used in medical research, particularly in the generation of 

30 transgenic animal models for human disease studies (Lathe and Mullins, 1993). More 
recently, it has been proposed to use transgenic pigs as organ donors for 
xenotransplantation by expressing human regulators of complement activation to prevent 
hyperacute rejection during organ transplantation (Cozzi and White, 1995). The 



fish. 
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development of disease resistant animals has also been tested in transgenic mice (e.g. Chen 
etaL, 1988). 

_Fish are also an intensive research subject of transgenic studies. There are many 
ways of introducing a foreign gene into fish, including: microinjection (e.g. Zhu et al., 
1985; Du et al., 1992), electroporation (Powers et al., 1992), sperm-mediated gene transfer 
(Khoo et al., 1992; Sin et al., 1993), gene bombardment or gene gun (Zelemin et al., 1991), 
liposome-mediated gene transfer (Szelei et al., 1994), and the direct injection of DNA into 
muscle tissue (Xu et al., 1999). The first transgenic fish report was published by Zhu et al. 
(1985) using a chimeric gene construct consisting of a mouse metallothionein gene 
promoter and a human growth hormone gene. Most of the early transgenic fish studies 
have concentrated on growth hormone gene transfer with an aim of generating fast growing 
"superfish". A majority of early attempts used heterologous growth hormone genes and 
promoters and failed to produce gigantic superfish (e.g. Chourrout et aL, 1986; Penman et 
al., 1990; Brem et al., 1988; Gross et al., 1992). But enhanced growth of transgenic fish has 
been demonstrated in several fish species including Atlantic salmon, several species of 
Pacific salmons, and loach (e.g. Du et al., 1992; Delvin et al., 1994, 1995; Tsai et al., 
1995). 

The zebrafish, Danio rerio, is a new model organism for vertebrate developmental 
biology. As an experimental model, the zebrafish offers several major advantages such as 
easy availability of eggs and embryos, tissue clarity throughout embryogenesis, external 
development, short generation time and easy maintenance of both the adult and the young. 
Transgenic zebrafish have been used as an experimental tool in zebrafish developmental 
biology. However, despite the fact that the first transgenic zebrafish was reported a decade 
ago (Stuart et aL, 1988), most transgenic zebrafish work conducted so far used 
heterologous gene promoters or viral gene promoters: e.g. viral promoters from SV40 
(simian virus 40) and RSV (Rous sarcoma virus) (Stuart et al., 1988, 1990; Bayer and 
Campos-Ortega, 1992), a carp actin promoter (Liu et al., 1990), and mouse homeobox gene 
promoters (Westerfield et al., 1992). As a result, the expression pattern of a transgene in 
many cases is variable and unpredictable. 

GFP (green fluorescent protein) was isolated from a jelly fish, Aqueous victoria. 
The wild type GFP emits green fluorescence at a wavelength of 508 nm upon stimulation 
with ultraviolet light (395 nm). The primary structure of GFP has been elucidated by 
cloning of its cDNA and genomic DNA (Prasher et aL, 1992). A modified GFP, also called 
EGFP (Enhanced Green Fluorescent Protein) has been generated artificially and it contains 
mutations that allow the protein to emit a stronger green light and its coding sequence has 
also been optimized for higher expression in mammalian cells based on preferable human 
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codons. As a result, EGFP fluorescence is about 40 times stronger than the wild type GFP 
in mammalian cells (Yang et al., 1996). GFP (including EGFP) has become a popular tool 
in cell biology and transgenic research. By fusing GFP with a tested protein, the GFP 
fusion-protein can be used as an indicator of the subcellular location of the tested protein 
(Wang and Hazelrigg, 1994) . By transformation of cells with a functional GFP gene, the 
GFP can be used as a marker to identify expressing cells (Chalfie et al., 1994). Thus, the 
GFP gene has become an increasingly popular reporter gene for transgenic research as GFP 
can be easily detected by a non-invasive approach. 

The GFP gene (including EGFP gene) has also been introduced into zebrafish in 
several previous reports by using various gene promoters, including Xenopus elongation 

factor la enhancer-promoter (Amsterdam et al., 1995, 1996), rat myosin light-chain 
enhancer (Moss et al. s 1996), zebrafish GATA-1 and GATA-3 promoters (Meng et al., 1997; 
Long et aL, 1997), zebrafish a- and f}-actin promoters (Higashijima et al., 1997), and 
tilapia insulin-like growth factor I promoter (Chen et al., 1998). All of these transgenic 
experiments aim at either developing a GFP transgenic system for gene expression analysis 
or at testing regulatory DNA elements in gene promoters. 



SUMMARY OF THE INVENTION 

It is a primary objective of the invention to clone fish gene promoters that are 
constitutive (ubiquitous), or that have tissue specificity such as skin specificity or muscle 
specificity or that are inducible by a chemical substance, and to use these promoters to 
develop effective gene constructs for production of transgenic fish. 

It is another objective of the invention to develop fluorescent transgenic ornamental 
fish using these gene constructs. By applying different gene promoters, tissue-specific, 
inducible under different environmental conditions, or ubiquitous, to drive the GFP gene, 
GFP could be expressed in different tissues or ubiquitously. Thus, these transgenic fish 
may be skin fluorescent, muscle fluorescent, ubiquitously fluorescent, or inducibly 
fluorescent. These transgenic fish may be used for ornamental purposes, for monitoring 
environmental pollution, and for basic studies such as recapitulation of gene expression 
programs or monitoring cell lineage and cell migration. These transgenic fish may be used 
for cell transplantation and nuclear transplantation or fish cloning. 

Other objectives, features and advantages of the present invention will become 
apparent from the detailed description which follows, or may be learned by practice of the 
invention. 
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Four zebrafish gene promoters of different characteristics were isolated and four 
chimeric gene constructs containing a zebrafish gene promoter and EGFP DNA were 
made: pCK-EGFP, pMCK-EGFP, pMLC2f-EGFP and pARP-EGFP. The first chimeric 
gene-construct, pCK-EGFP, contains a 2.2 kbp polynucleotide comprising a zebrafish 
5 cytokeratin (CK) gene promoter which is specifically or predominantly expressed in skin 
epithelia. The second one, pMCK-EGFP, contains a 1.5 kbp polynucleotide comprising a 
muscle-specific promoter from a zebrafish muscle creatine kinase (MCK) gene and the 
gene is only expressed in the muscle tissue. The third construct, pMLC2f-EGFP contains a 
2.2 kpb polynucleotide comprising a strong skeletal muscle-specific promoter from the fast 
10 skeletal muscle isoform of the myosin light chain 2 (MLC2f) gene and is expressed 
specifically or predominantly in skeletal muscle. The fourth chimeric gene construct, 
pARP-EGFP, contains a strong and ubiquitously expressed promoter from a zebrafish 
acidic ribosomal protein (ARP) gene. These four chimeric gene constructs have been 
introduced into zebrafish at the one cell stage or two cell stage by microinjection. In ail 
15 cases, the GFP expression patterns were consistent with the specificities of the promoters. 
GFP was predominantly expressed in skin epithelia with pCK-EGFP, specifically 
expressed in muscles with pMCK-EGFP, specifically expressed in skeletal muscles with 
pMLC2f-EGFP and ubiquitously expressed in all tissues with pARP-EGFP. 

These chimeric gene constructs are useful to generate green fluorescent transgenic 
20 fish. The GFP transgenic fish emit green fluorescence light under a blue or ultraviolet light 
and this feature makes the genetically engineered fish unique and attractive in the 
ornamental fish market. The fluorescent transgenic fish are also useful for the development 
of a biosensor system and as research models for embryonic studies such as cell lineage, 
cell migration, cell and nuclear transplantation etc. 



25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. 1A-1I are photographs showing expression of CK (Figs. 1A-1C), MCK (Figs. 
ID-IE), ARP (Figs. 1F-1G) and MLC2f (Figs. 1H-1I) mRNAs in zebrafish embryos as 
revealed by whole mount in situ hybridization (detailed description of the procedure can be 
found in Thisse et al., 1993). (Fig. 1A) A 28 hpf (hour postfertilization) embryo hybridized 

30 with a CK antisense riboprobe. (Fig. IB) Enlargement of the mid-part of the embryo 
shown in Fig. 1A. (Fig. 1C) Cross-section of the embryo in Fig. 1A. (Fig. ID) A 30 hpf 
embryo hybridized with an MCK antisense riboprobe. (Fig. IE) Cross-section of the 
embryo in Fig ID. (Fig. IF) A 28 hpf embryo hybridized with an ARP antisense riboprobe. 
(Fig. 1G) Cross-section of the embryo in Fig. IF. Arrows indicate the planes for cross- 

35 sections and the box in panel A indicates the enlarged region shown in panel B. (Fig. 1H) 
Side view of a 22-hpf embryo hybridized with the MLC2f probe. (Fig. II) Transverse 
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section through the trunk of a stained 24-hpf embryo. SC, spinal cord; N, notochord. 

Fig. 2A is a digitized image showing distribution of CK, MCK and ARP mRNAs in 
adultjissues. Total RNAs were prepared from selected adult tissues as indicated at the top 
of each lane and analyzed by Northern blot hybridization (detailed description of the 
5 procedure can be found in Gong et al., 1992). Three identical blots were made from the 
same set of RNAs and hybridized with the CK, MCK and ARP probes, respectively. 

Fig. 2B is a digitized image showing distribution of MLC2f mRNA in adult tissues. 
Total RNAs were prepared from selected adult tissues as indicated at the top of each lane 
and analyzed by Northern blot hybridization (detailed description of the procedure can be 
10 found in Gong et aL, 1992). Two identical blots were made from the same set of RNAs and 
hybridized with the MLC2f probe and a ubiquitously expressed U-actin probe, 
respectively. 

Fig. 3. is a schematic representation of the strategy of promoter cloning. Restriction 
enzyme digested genomic DNA was ligated with a short linker DNA which consists of 
15 Oligo 1 and Oligo 2. Nested PCR reactions were then performed: the first round PCR used 
linker specific primer LI and gene specific primers Gl, where Gl is CK1, MCK1, Ml or 
ARP1 in the described embodiments, and the second round linker specific primer L2 and 
gene specific primer G2, where G2 is CK2, MCK2, M2 or ARP2, respectively in the 
described embodiments. 

20 Fig. 4 is a schematic map of the chimeric gene construct, pCK-EGFP. The 2.2 kb 

zebrafish DNA fragment comprising the CK promoter region is inserted into pEGFP-1 
(Cionetech) at the EcoRI and BamHI site as indicated. In the resulting chimeric DNA 
construct, the EGFP gene is under control of the zebrafish CK promoter. Also shown is the 

kanamycin/neomycin resistance gene (KanWeo 1 ) in the backbone of the original pEGFP-1 
25 plasmid. The total length of the recombinant plasmid pCK-EGFP is 6.4 kb. 

Fig. 5 is a schematic map of the chimeric gene construct, pMCK-EGFP. The 1.5 kb 
zebrafish DNA fragment comprising the MCK promoter region is inserted into pEGFP-1 
(Cionetech) at the EcoRI and BamHI site as indicated. In the resulting chimeric DNA 
construct, the EGFP gene is under control of the zebrafish MCK promoter. Also shown is 
30 the kanamycin/neomycin resistance gene (Ka^/Neo 1 ") in the backbone of the original 
pEGFP-1 plasmid. The total length of the recombinant plasmid pMCK-EGFP is 5.7 kb. 



Fig. 6 is a schematic map of the chimeric gene construct, pARP-EGFP. The 2.2 kb 
zebrafish DNA fragment comprising the ARP promoter/1 st intron region is inserted into 
pEGFP-1 (Cionetech) at the EcoRI and BamHI site as indicated. In the resulting chimeric 
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DNA construct, the EGFP gene is under control of the zebrafish ARP promoter. Also 
shown is the kanamycin/neomycin resistance gene (KanWeo 1 ) in the backbone of the 
original pEGFP-1 plasmid. The total length of the recombinant plasmid pARP-EGFP is 6.4 
kb. - 

5 Fig. 7 is a schematic map of the chimeric gene construct, pMLC2f-EGFP. The 2.0 

kb zebrafish DNA fragment comprising the MLC2f promoter region is inserted into 
pEGFP-1 (Clonetech) at the Hindlll and BamHI site as indicated. In the resulting chimeric 
DNA construct, the EGFP gene is under control of the zebrafish MLC2f promoter. Also 
shown is the kanamycin/neomycin resistance gene (KanVNeo 1 ) in the backbone of the 
10 original pEGFP-1 plasmid. The total length of the recombinant plasmid pMLC2f-EGFP is 
6.2 kb. 

Fig. 8 is a photograph of a typical transgenic zebrafish fry (4 days old) with pCK- 
EGFP, which emits green fluorescence from skin epithelia under a blue light. 

Fig. 9 is a photograph of a typical transgenic zebrafish fry (3 days old) with pMCK- 
15 EGFP, which emits green fluorescence from skeletal muscles under a blue light. 

Fig. 10 is a photograph of a typical transgenic zebrafish fry (2 days old) with 
pARP-EGFP, which emits green fluorescence under a blue light from a variety of cell 
types such as skin epithelia, muscle cells, lens, neural tissues, notochord, circulating blood 
cells and yolk cells. 

Figs. 11A-11B. Photographs of a typical transgenic zebrafish founder with 
pMLC2f-EGFP (Fig. 11 A) and an Fl stable transgenic offspring (Fig. 1 IB). Both pictures 
were taken under an ultraviolet light (365 nm). The green fluorescence can be better 
observed under a blue light with an optimal wavelength of 488 nm. 

Figs. 12A-12C. Examples of high, moderate and low expression of GFP in 
transiently transgenic embryos at 72 hpf. (Fig. 12A) High expression, GFP expression was 
detected in essentially 100% of the muscle fibers in the trunk. (Fig. 12B) Moderate 
expression, GFP expression was detected in several bundles of muscle fibers, usually in the 
mid-trunk region. (Fig. 12C) Low expression, GFP expression occurred in dispersed 
muscle fibers and the number of GFP positive fibers is usually less than 20 per embryo. 

Fig. 13. Deletion analysis of the MLC2f promoter in transient transgenic zebrafish 
embryos. A series of 5' deletions of MLC2f-EGFP constructs containing -20 11 -bp (2-kb), 
-1338-bp, -873-bp, -283-bp, -77-bp and -3-bp of the MLC2f promoter were generated by 
unidirectional deletion using the double-stranded Nested Deletion Kit from Pharmacia 
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based on the manufacturer's instructional manual. Each construct was injected into 
approximately 100 embryos and GFP expression was monitored in the first 72 hours of 
embryonic development. The level of GFP expression was classified based on the 
examples shown in Figs. 12A-12C. Potential E-boxes and MEF2 binding sites, which are 
important for muscle-specific transcription (Schwarz et al., 1993; Olson et al., 1995), are 
indicated on the — 201 1-bp construct. 




DETAILED DESCRIPTION OF THE INVENTION 
Gene Constructs 

To develop successful transgenic fish with a predictable pattern of transgene 
expression, the first step is to make a gene construct suitable for transgenic studies. The 
5 gene construct generally comprises three portions: a gene promoter, a structural gene and 
transcriptional termination signals. The gene promoter would determine where, when and 
under what conditions the structural gene is turned on. The structural gene contains protein 
coding portions that determine the protein to be synthesized and thus the biological 
function. The structural gene might also contain intron sequences which can affect mRNA 

10 stability or which might contain transcription regulatory elements. The transcription 
termination signals consist of two parts: a polyadenylation signal and a transcriptional 
termination signal after the polyadenylation signal. Both are important to terminate the 
transcription of the gene. Among the three portions, selection of a promoter is very 
important for successful transgenic study, and it is preferable to use a homologous 

15 promoter (homologous to the host fish) to ensure accurate gene activation in the transgenic 
host. 

A promoter drives expression "predormnantly" in a tissue if expression is at least 2- 
fold, preferably at least 5-fold higher in that tissue compared to a reference tissue. A 
promoter drives expression "specifically 1 ' in a tissue if the level of expression is at least 5- 
20 fold, preferably at least 10-fold higher, more preferably at least 50-fold higher in that tissue 
than in any other tissue. 

Recombinant DNA Constructs 

Recombinant DNA constructs comprising one or more of the DNA or RNA 
sequences described herein and an additional DNA and/or RNA sequence are also included 
25 within the scope of this invention. These recombinant DNA constructs usually have 
sequences which do not occur in nature or exist in a form that does not occur in nature or 
exist in association with other materials that do not occur in nature. The DNA and/or RNA 
sequences described as constructs or in vectors above are "operably linked" with other 
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DNA and/or RNA sequences. DNA regions are operably linked when they are functionally 
related to each other. For example, DNA for a presequence or secretory leader is operably 
linked to DNA for a polypeptide if it is expressed as part of a preprotein which participates 
in the-secretion of the polypeptide; a promoter is operably linked to a coding sequence if it 
5 controls the transcription of the coding sequence; a ribosome binding site is operably 
linked to a coding sequence if it is positioned so as to permit translation. Generally, 
operably linked means contiguous (or in close proximity to) and, in the case of secretory 
leaders, contiguous and in reading phase. 

The sequences of some of the DNAs, and the corresponding proteins encoded by 
10 the DNA, which are useful in the invention are set forth in the attached Sequence Listing. 

The complete cytokeratin (CK) cDNA sequence is shown in SEQ ID NO:l, and its 
deduced amino acid sequence is shown in SEQ ID NO:2. The binding sites of the gene 
specific primers for promoter amplification, CK1 and CK2, are indicated. The extra 
nucleotides introduced into CK2 for generation of a restriction site are shown as a 
15 miscjeature in the primer sequence SEQ ID NO: 11. A potential polyadenylation signal, 
AATAAA, is indicated in SEQ ID NO:l. 

The complete muscle creatine kinase (MCK) cDNA sequence is shown in SEQ ID 
NO:3, and its deduced amino acid sequence is shown in SEQ ID NO:4. The binding sites 
of the gene specific primers for promoter amplification, MCK1 and MCK2, are indicated. 
20 The extra nucleotides introduced into MCK1 and MCK2 for generation of restriction sites 
are shown as a miscjeature in the primer sequences SEQ ID NOS:12 and 13, respectively. 
A potential polyadenylation signal, AATAAA, is indicated in SEQ ID NO:3. 

The complete fast skeletal muscle isoform of myosin light chain 2 (MLC2f) cDNA 
sequence is shown in SEQ ID NO:20, and its deduced amino acid sequence is shown in 
25 SEQ ID NO:21 . The binding sites of the gene-specific primers for promoter amplification, 
Ml and M2, are indicated. Two potential polyadenylation signals, AATAAA, are shown as 
a misc_feature in SEQ ID NO:20. 

The complete acidic ribosomal protein P0 (ARP) cDNA sequence is shown in SEQ 
ID NO:5, and its deduced amino acid sequence is shown in SEQ ID NO:6. The binding 
30 sites of the gene specific primers for promoter amplification, ARP1 and ARP2, are 
indicated. The extra nucleotides introduced into ARP2 for generation of a restriction site 
are shown as a misc_feature in the primer sequence SEQ ID NO: 15. A potential 
polyadenylation signal, AATAAA, is indicated in SEQ ID NO:5. 




SEQ ID NO:7 shows the complete sequence of the CK promoter region. A putative 
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TATA box is shown, and the 3' nucleotides identical to the 5 1 CK cDNA sequence are 
shown as a misc_feature. The binding site of the second gene specific primer, CK2, is 
shown. The introduced BamHI site is indicated as a misc feature in the primer sequence 
SEQ -ID NO: 11. 

5 SEQ ID NO:8 shows the complete sequence of the MCK promoter region. A 

putative TATA box is shown, and the 3* nucleotides identical to the 5' MCK cDNA 
sequence are shown as a misc feature in SEQ ID NO:8. The binding site of the second 
gene specific primer, MCK2, is shown. The introduced BamHI site is indicated as a 
misc^feature in the primer sequence SEQ ID NO: 13. 

10 SEQ ID NO:22 shows the complete sequence of the MLC2f promoter region. A 

putative TATA box is shown, and the 3* nucleotides identical to the 5' MLC2f cDNA 
sequence are shown as a misc_feature. The binding site of the second gene-specific primer, 
M2, is shown. Potential muscle-specific cis-elements, E-boxes and MEF2 binding sites, are 
also shown. The proximal 1-kb region of the MLC2f promoter was recently published (Xu 

15 etaL, 1999). 

SEQ ID NO: 9 shows the complete sequence of the ARP promoter region including 
the first intron. The first intron is shown, and the 3' nucleotides identical to the 5' ARP 
cDNA sequence are shown as misc_features. No typical TATA box is found. The binding 
site of the second gene specific primer, ARP2, is shown. The introduced BamHI site is 
20 indicated as a miscjfeature in the primer sequence SEQ ID NO:l 5. 

Specifically Exemplified Polypeptides/DNA 

The present invention contemplates use of DNA that codes for various polypeptides 
and other types of DNA to prepare the gene constructs of the present invention. DNA that 
codes for structural proteins, such as fluorescent peptides including GFP, EGFP, BFP, 

25 EBFP, YFP, EYFP, CFP, ECFP and enzymes (such as luciferase, B-galactosidase, 
chloramphenicol acetyitransferase, etc.), and hormones (such as growth hormone etc.), are 
useful in the present invention. More particularly, the DNA may code for polypeptides 
comprising the sequences exemplified in SEQ ID NOS:2, 4, 6 and 21. The present 
invention also contemplates use of particular DNA sequences, including regulatory 

30 sequences, such as promoter sequences shown in SEQ ID NOS: 7, 8, 9 and 22 or portions 
thereof effective as promoters. Finally, the present invention also contemplates the use of 
additional DNA sequences, described generally herein or described in the references cited 
herein, for various purposes. 
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Chimeric Genes 

The present invention also encompasses chimeric genes comprising a promoter 
described herein operatively linked to a heterologous gene. Thus, a chimeric gene can 
comprise a promoter of a zebrafish operatively linked to a zebrafish structural gene other 
5 than that normally found linked to the promoter in the genome. Alternatively, the 
promoter can be operatively linked to a gene that is exogenous to a zebrafish, as 
exemplified by the GFP and other genes specifically exemplified herein. Furthermore, a 
chimeric gene can comprise an exogenous promoter linked to any structural gene not 
normally linked to that promoter in the genome of an organism. 

10 Variants of Specifically Exemplified Polypeptide 

DNA that codes for variants of the specifically exemplified polypeptides are also 
encompassed by the present invention. Possible variants include allelic variants and 
corresponding polypeptides from other organisms, particularly other organisms of the same 
species, genus or family. The variants may have substantially the same characteristics as 

15 the natural polypeptides. The variant polypeptide will possess the primary property of 
concern for the polypeptide. For example, the polypeptide will possess one or more or all 
of the primary physical (e.g., solubility) and/or biological (e.g., enzymatic activity, 
physiologic activity or fluorescence excitation or emission spectrum) properties of the 
reference polypeptide. DNA of the structural genes of the present invention will encode a 

20 protein that produces a fluorescent or chemiluminescent light under conditions appropriate 
to the particular polypeptide in one or more tissues of a fish. Preferred tissues for 
expression are skin, muscle, eye and bone. 

Substitutions, Additions and Deletions 

As possible variants of the above specifically exemplified polypeptides, the 
25 polypeptide may have additional individual amino acids or amino acid sequences inserted 
into the polypeptide in the middle thereof and/or at the N-terminal and/or C-terminal ends 
thereof so long as the polypeptide possesses the desired physical and/or biological 
characteristics. Likewise, some of the amino acids or amino acid sequences may be deleted 
from the polypeptide so long as the polypeptide possesses the desired physical and/or 
30 biochemical characteristics. Amino acid substitutions may also be made in the sequences 
so long as the polypeptide possesses the desired physical and biochemical characteristics. 
DNA coding for these variants can be used to prepare gene constructs of the present 
invention. 
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Sequence Identity 

The variants of polypeptides or polynucleotides contemplated herein should 
possess more than 75% sequence identity (sometimes referred to as homology), preferably 
more than 85% identity, most preferably more than 95% identity, even more preferably 
5 more than 98% identity to the naturally occurring and/or specifically exemplified 
sequences or fragments thereof described herein. To detenriine this homology, two 
sequences are aligned so as to obtain a maximum match using gaps and inserts. 

Two sequences are said to be "identical" if the sequence of residues is the same 
when aligned for maximum correspondence as described below. The term 
10 "complementary" applies to nucleic acid sequences and is used herein to mean that the 
sequence is complementary to all or a portion of a reference polynucleotide sequence. 

Optimal alignment of sequences for comparison can be conducted by the local 
homology algorithm of Smith and Waterman (1981), by the homology alignment method 
of Needleman and Wunsch (1970), by the search for similarity method of Pearson and 
15 Lippman (1988), or the like. Computer implementations of the above algorithms are 
known as part of the Genetics Computer Group (GCG) Wisconsin Genetics Software 
Package (GAP, BESTFIT, BLASTA, FASTA and TFASTA), 575 Science Drive, Madison, 
WI. These programs are preferably run using default values for ail parameters. 

"Percentage of sequence identity" is determined by comparing two optimally 
20 aligned sequences over a comparison window, wherein the portion of the sequence in the 
comparison window may comprise additions or deletions (i.e. "gaps") as compared to the 
reference sequence for optimal alignment of the two sequences being compared. The 
percentage identity is calculated by determining the number of positions at which the 
identical residue occurs in both sequences to yield the number of matched positions, 
25 dividing the number of matched positions by the total number of positions in the window 
and multiplying the result by 100 to yield the percentage of sequence identity. Total 
identity is then determined as the average identity over all of the windows that cover the 
complete query sequence. 

Fragments of Polypeptide 

30 Genes which code for fragments of the full length polypeptides such as proteolytic 

cleavage fragments which contain at least one, and preferably ail, of the above-listed 
physical and/or biological properties are also encompassed by the present invention. 
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DNA and RNA 

The invention encompasses DNA that codes for any one of the above-described 
polypeptides including, but not limited to, those shown in SEQ ID NOS:2, 4, 6 and 21 
including fusion polypeptides, variants and fragments thereof. The sequence of certain 
5 particularly useful cDNAs which encode polypeptides are shown in SEQ ID NOS;l, 3, 5 
and 20, The present invention also includes cDNA as well as genomic DNA containing or 
comprising the requisite nucleotide sequences as well as corresponding RNA and antisense 
sequences. 

Cloned DNA within the scope of the invention also includes allelic variants of the 
10 specific sequences presented in the attached Sequence Listing. An "allelic variant" is a 
sequence that is a variant from that of the exemplified nucleotide sequence, but represents 
the same chromosomal locus in the organism. In addition to those which occur by normal 
genetic variation in a population and perhaps fixed in the population by standard breeding 
methods, allelic variants can be produced by genetic engineering methods. A preferred 
15 allelic variant is one that is found in a naturally occurring organism, including a laboratory 
strain. Allelic variants are either silent or expressed. A silent allele is one that does not 
affect the phenotype of the organism. An expressed allele results in a detectable change in 
the phenotype of the trait represented by the locus. 

A nucleic acid sequence "encodes" or "codes for" a polypeptide if it directs the 
20 expression of the polypeptide referred to. The nucleic acid can be DNA or RNA. Unless 
otherwise specified, a nucleic acid sequence that encodes a polypeptide includes the 
transcribed strand, the hnRNA and the spliced RNA or the DNA representative of the 
mRNA. An "antisense" nucleic acid is one that is complementary to all or part of a strand 
representative of mRNA, including untranslated portions thereof. 

25 Degenerate Sequences 

In accordance with degeneracy of genetic code, it is possible to substitute at least 
one base of the base sequence of a gene by another kind of base without causing the amino 
acid sequence of the polypeptide produced from the gene to be changed. Hence, the DNA 
of the present invention may also have any base sequence that has been changed by 
30 substitution in accordance with degeneracy of genetic code. 

DNA Modification 

The DNA is readily modified by substitution, deletion or insertion of nucleotides, 
thereby resulting in novel DNA sequences encoding the polypeptide or its derivatives. 
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These modified sequences are used to produce mutant polypeptide and to directly express 
the polypeptide. Methods for saturating a particular DNA sequence with random mutations 
and also for making specific site-directed mutations are known in the art; see e.g. 
Sambrook et ah (1989). 

5 Hybridizable Variants 

The DNA molecules useful in accordance with the present invention can comprise a 
nucleotide sequence selected from the group consisting of SEQ ID NOS.rl, 3, 5, 7-20 and 
22-24 or can comprise a nucleotide sequence that hybridizes to a DNA molecule 
comprising the nucleotide sequence of SEQ ID NOS.:l, 3, 5 or 20 under salt and 

10 temperature conditions providing stringency at least as high as that equivalent to 5x SSC 
and 42°C and that codes on expression for a polypeptide that has one or more or all of the 
above-described physical and/or biological properties. The present invention also includes 
polypeptides coded for by these hybridizable variants. The relationship of stringency to 
hybridization and wash conditions and other considerations of hybridization can be found 

15 in Chapters 1 1 and 12 of Sambrook et al (1989). The present invention also encompasses 
functional promoters which hybridize to SEQ ID NOS:7, 8, 9 or 22 under the above- 
described conditions. DNA molecules of the invention will preferably hybridize to 
reference sequences under more stringent conditions allowing the degree of mismatch 
represented by the degrees of sequence identity enumerated above. The present invention 

20 also encompasses functional primers or linker oligonucleotides set forth in SEQ ID 
NOS:10-19 and 23-24 or larger primers comprising these sequences, or sequences which 
hybridize with these sequences under the above-described conditions. The primers usually 
have a length of 10-50 nucleotides, preferably 15-35 nucleotides, more preferably 18-30 
nucleotides. 

25 Vectors 

The invention is further directed to a replicable vector containing cDNA that codes 
for the polypeptide and that is capable of expressing the polypeptide. 

The present invention is also directed to a vector comprising a replicable vector and 
a DNA sequence corresponding to the above described gene inserted into said vector. The 
30 vector may be an integrating or non-integrating vector depending on its intended use and is 
conveniently a plasmid. 

Transformed Cells 

The invention further relates to a transformed cell or microorganism containing 
cDNA or a vector which codes for the polypeptide or a fragment or variant thereof and that 
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is capable of expressing the polypeptide. 

Expression Systems Using Vertebrate Cells 

~~ Interest has been great in vertebrate cells, and propagation of vertebrate cells in 
culture (tissue culture) has become a routine procedure. Examples of vertebrate host cell 
5 lines useful in the present invention preferably include cells from any of the fish described 
herein. Expression vectors for such cells ordinarily include (if necessary) an origin of 
replication, a promoter located upstream from the gene to be expressed, along with a 
ribosome-binding site, RNA splice site (if intron-containing genomic DNA is used or if an 
intron is necessary to optimize expression of a cDNA), a polyadenylation site, and a 
10 transcription termination sequence. 

EXAMPLES 

The following examples are provided by way of illustration only and not by way of 
limitation. Those of skill will readily recognize a variety of noncritical parameters which 
can be changed or modified to yield essentially similar results. 

15 Example I: Isolation of skin-specific, muscle-specific and ubiquitously expressed 



cDNA clones were isolated and sequenced as described by Gong et al. (1997). 
Basically, random cDNA clones were selected from zebrafish embryonic and adult cDNA 
libraries and each clone was partially sequenced by a single sequencing reaction. The 

20 partial sequences were then used to identify the sequenced clones for potential function and 
tissue specificity. Of the distinct clones identified by this approach, four of them were 
selected: for skin specificity (clone A3 9 encoding cytokeratin, CK), for muscle specificity 
(clone E146 encoding muscle creatine kinase, MCK), for skeletal muscle specificity (clone 
A113 encoding the fast skeletal muscle isoform of the myosin light chain 2, MLC2f) and 

25 for ubiquitous expression (clone A150 encoding acidic ribosomal protein P0, ARP), 
respectively. 

The four cDNA clones were sequenced, and their complete cDNA sequences with 
deduced amino acid sequences are shown in SEQ ID NOS:l, 3, 5, and 20 respectively. A39 
encodes a type II basic cytokeratin and its closest homolog in mammals is cytokeratin 8 
30 (65-68% amino acid identity). El 46 codes for the zebrafish MCK and its amino acid 
sequence shares ~87% identity with mammalian MCKs. A113 encodes the fast skeletal 
muscle isoform of the myosin light chain 2. The deduced amino acid sequence of this gene 
is highly homologous to other vertebrate fast skeletal muscle MLC2f proteins (over 80% 



zebrafish cDNA clones. 
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amino acid identity). The amino acid sequence of zebrafish ARP deduced from the A 150 
clone is 87-89% identical to those of mammalian ARPs. 

To demonstrate their expression patterns, whole mount in situ hybridization (Thisse 
et al., 1993) was performed for developing embryos and Northern blot analyses (Gong et 
5 al., 1992) were carried out for selected adult tissues and for developing embryos. 

As indicated by whole mount in situ hybridization, cytokeratin mRNA was 
specifically expressed in the embryonic surface (Figs. 1A-1C ) and cross section of in situ 
hybridized embryos confirmed that the expression was only in skin epithelia (Fig. 1C). 
Ontogenetically, the cytokeratin mRNA appeared before 4 hours post-fertilization (hpf) 

10 and it is likely that the transcription of the cytokeratin gene starts at mid-blastula transition 
when the zygotic genome is activated. By in situ hybridization, a clear cytokeratin mRNA 
signal was detected in highly flattened cells of the superficial layer in blastula and the 
expression remained in the superficial layer which eventually developed into skin epithelia 
including the yolk sac. In adult tissues, cytokeratin mRNA was predominantly detected in 

15 the skin and also weakly in several other tissues including the eye, gill, intestine and 
muscle, but not in the liver and ovary (Fig. 2). Therefore, the cytokeratin mRNA is 
predominantly, if not specifically, expressed in skin cells. 

MCK mRNA was first detected in the first few anterior somites in 10 somite stage 
embryos (14 hpf) and at later stages the expression is specifically in skeletal muscle (Fig. 
20 ID) and in heart (data not shown). When the stained embryos are cross-sectioned, the 
MCK mRNA signal was found exclusively in the trunk skeletal muscles (Fig. IE). In adult 
tissues, MCK mRNA was detected exclusively in the skeletal muscle (Fig. 2). 

MLC2f mRNA was specifically expressed in fast skeletal muscle in developing 
zebrafish embryos (Figs. 1H- II). To examine the tissue distribution of MLC2f mRNA, total 
25 RNAs were prepared from several adult tissues including heart, brain, eyes, gills, intestine, 
liver, skeletal muscle, ovary, skin, and testis. MLC2f mRNA was only detected in the 
skeletal muscle by Northern analysis; while a-actin mRNA was detected ubiquitously in 
the same set of RNAs, confirming the validity of the assay (Fig. 2B). 

ARP mRNA was expressed ubiquitously and it is presumably a maternal mRNA 
since it is present in the ovary as well as in embryos at one cell stage. In in situ 
hybridization experiments, an intense hybridization signal was detected in most tissues. An 
example of a hybridized embryo at 28 hpf is shown in Fig. IF. In adults, ARP mRNA was 
abundantly expressed in all tissues examined except for the brain where a relatively weak 
signal was detected (Fig. 2A), These observations confirmed that the ARP mRNA is 
35 expressed ubiquitously. 
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Example II: Isolation of zebrafish gene promoters 

Four zebrafish gene promoters were isolated by a linker-mediated PCR method as 
described by Liao et aL> (1997) and as exemplified by the diagrams in Fig. 3. The whole 
procedure includes the following steps: 1) designing of gene specific primers; 2) isolation 
5 of zebrafish genomic DNA; 3) digestion of genomic DNA by a restriction enzyme; 4) 
ligation of a short linker DNA to the digested genomic DNA; 5) PCR amplification of the 
promoter region; and 6) DNA sequencing to confirm the cloned DNA fragment. The 
following is the detailed description of these steps. 

1. Designing of gene specific primers 

10 Gene specific PCR primers were designed based on the 5' end of the four cDNA 

sequences and the regions used for designing the primers are shown in SEQ ID NOS: 1,3, 
5 and 20. 

The two cytokeratin gene specific primers are: 
CK1 (SEQ ID NO: 10) 

15 CK2 (SEQ ID NO:l 1), where the first six nucleotides are for creation of an EcoRJ site to 
facilitate cloning. 

The two muscle creatine kinase gene specific primers are: 

MCK1 (SEQ ID NO: 12), where the first five nucleotides are for creation of an EcoRI site 
to facilitate cloning. 

20 MCK2 (SEQ ID NO: 13), where the first three nucleotides are for creation of an EcoRI site 
to facilitate cloning. 

The two fast skeletal muscle isoform of myosin light chain 2 gene specific primers are: 
Ml (SEQ ID NO:23) 
M2(SEQ ID NO:24) 

25 The two acidic ribosomal protein P0 gene specific primers are: 



ARP1 (SEQ ID NO: 14) 
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ARP2 (SEQ ID NO: 15), where the first six nucleotides are for creation of an EcoRI site to 
facilitate cloning. 

2. ~" Isolation of zebrafish genomic DNA 

Genomic DNA was isolated from a single individual fish by a standard method (Sambrook 
et al. 9 1989). Generally, an adult fish was quickly frozen in liquid nitrogen and ground into 
powder. The ground tissue was then transferred to an extraction buffer (10 mM Tris, pH 8, 
0.1 M EDTA, 20 u-g/ml RNase A and 0.5% SDS) and incubated at 37°C for 1 hour. 
Proteinase K was added to a final concentration of 1 00 p.g/ml and gently mixed until the 
mixture appeared viscous, followed by incubation at 50°C for 3 hours with periodical 
swirling. The genomic DNA was gently extracted three times by phenol equilibrated with 
Tris-HCl (pH 8), precipitated by adding 0.1 volume of 3 M NaOAc and 2.5 volumes of 
ethanol, and collected by swirling on a glass rod, then rinsed in 70% ethanol. 

3. Digestion of genomic DNA by a restriction enzyme 

Genomic DNA was digested with the selected restriction enzymes. Generally, 500 
units of restriction enzyme were used to digest 50 p,g of genomic DNA overnight at the 
optimal enzyme reaction temperature (usually at 37°C), 

4. Ligation of a short linker DNA to the digested genomic DNA 

The linker DNA was assembled by annealing equal moles of the two linker 
oligonucleotides, Oiigol (SEQ ID NO.T6) and Oligo 2 (SEQ ID NO: 17). Oligo 2 was 
phosphorylated by T4 polynucleotide kinase prior to annealing. Restriction enzyme 
digested genomic DNA was filled-in or trimmed with T4 DNA polymerase, if necessary, 
and ligated with the linker DNA. Ligation was performed with 1 u.g of digested genomic 
DNA and 0.5 \xg of linker DNA in a 20 u.1 reaction containing 10 units of T4 DNA ligase 

at 4°C overnight. 

5. PCR amplification of promoter region 

PCR was performed with Advantage Tth Polymerase Mix (Clontech). The first 
round of PCR was performed using a linker specific primer LI (SEQ ID NO: 18) and a 
gene specific primer Gl (CK1, MCK1, Ml or ARP1), Each reaction (50 jllI) contains 5 u.1 
of lOx Tth PCR reaction buffer (1X= 15 mM KOAc, 40 mM Tris, pH 9,3), 2.2 uJ of 25 
mM Mg(OAc)2, 5 ul of 2 mM dNTP, 1 ul of LI (0.2 u.g/u.lX 1 ul of Gl (0.2 jig/pl) , 33.8 
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|al of H2O, and 1 \x\ (50 ng) of Linker ligated genomic DNA and 1 ul of 50x Tth 
polymerase mix (Clontech). The cycling conditions were as follows: 94°C/1 min, 35 
cycles of 94°C/30 sec and 68°C/6 min, and finally 68°C/8 min. After the primary round of 
PCR^was completed, the products were diluted 100 fold. One ul of diluted PCR product 
5 was used as template for the second round of PCR (nested PCR) with a second linker 
specific primer L2 (SEQ ID NO: 19) and a second gene specific primer G2 (CK2, MCK2, 
M2 or ARP2), as described for the primary PCR but with the following modification: 
94°C/1 min, 25 cycles of 94°C/30 sec and 68°C/6 min, and finally 68°C/8 min. Both the 
primary and secondary PCR products were analyzed on a 1% agarose gel. 

10 6. DNA sequencing to confirm the cloned DNA fragment 

PCR products were purified from the agarose gel following electrophoresis and 
cloned into a TA vector, pT7Blue™ (Novogen). DNA sequencing was performed by 
dideoxynucleotide chain termination method using a T7 Sequencing Kit purchased from 
Pharmacia. Complete sequences of these promoter regions were obtained by automatic 
15 sequencing using a dRhodamine Terminator Cycle Sequencing Ready Reaction Kit 
(Perkin-Elmer) and an ABI 377 automatic sequencing machine. 

The isolated cytokeratin DNA fragment comprising the gene promoter is 2.2 kb. In 
the 3' proximal region immediately upstream of a portion identical to the 3' part of the CK 
cDNA sequence, there is a putative TATA box perfectly matching to a consensus TATA 
20 box sequence. The 164 bp of the 3' region is identical to the 5' UTR (untranslated region) 
of the cytokeratin cDNA. Thus, the isolated fragment was indeed derived from the same 
gene as the cytokeratin cDNA clone (SEQ ID NO:7). Similarly, a 1.5 kb 5' flanking region 
was isolated from the muscle creatine kinase gene, a putative TATA box was also found in 
its 3' proximal region and the 3' region is identical to the 5' portion of the MCK cDNA 

25 clone (SEQ ID NO:8). For MLC2f, a 2 kb region was isolated from the fast skeletal muscle 
isoform of myosin light chain 2 gene and sequenced completely. The promoter sequence 
for MLC2f is shown in SEQ ID NO:22. The sequence immediately upstream of the gene 
specific primer M2 is identical to the 5' UTR of the MLC2f cDNA clone; thus, the 
amplified DNA fragments are indeed derived from the MLC2f gene. A perfect TATA box 

30 was found 30 nucleotides upstream of the transcription start site, which was defined by a 
primer extension experiment based on Sambrook et al. (1989). In the 2-kb region 
comprising the promoter, six E-boxes (CANNTG) and six potential MEF2 binding sites 
[C/T)TA(T/A)4TA(A/G)] were found and are indicated in SEQ ID NO:22. Both of these 
cis-element classes are important for muscle specific gene transcription (Schwarz et aL, 

35 1993; Olson et aL, 1995). A 2.2 kb fragment was amplified for the ARP gene. By 
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alignment of its sequence with the ARP cDNA clone, a 1.3 kb intron was found in the 5' 
UTR (SEQ ID NO:9). As a result, the isolated ARP promoter is within a DNA fragment 
ahout 0.8 kb long. 



Example III: Generation of green fluorescent transgenic fish 

5 The isolated zebrafish gene promoters were inserted into the plasmid pEGFP-1 

(Clonetech), which contains an EGFP structural gene whose codons have been optimized 
according to preferable human codons. Three promoter fragments were inserted into 
pEGFP-1 at the EcoRl and BamHI site and the resulting recombinant plasmids were named 
pCK-EGFP (Fig. 4), pMCK-EGFP (Fig. 5), and pARP-EGFP, respectively (Fig. 6). The 
10 promoter fragment for the MLC2f gene was inserted into the Hind III and Bam HI sites of 
the plasmid pEGFP-1 and the resulting chimeric DNA construct, pMLC2f-EGFP, is 
diagramed in Fig. 7. 

Linearized plasmid DNAs at a concentrations of 500 jxg/ml (for pCK-EGFP and 
pMCK-EGFP) and 100 fig/ml (for pMLC2f-EGFP) in 0.1 M Tris-HCl (pH 7.6)/0.25% 

15 phenol red were injected into the cytoplasm of 1- or 2-cell stage embryos. Because of a 
high mortality rate, pARP-EGFP was injected at a lower concentration (50 u.g/ml). Each 
embryo received 300-500 pi of DNA. The injected embryos were reared in autoclaved 
Holtfreter's solution (0.35% NaCi, 0,01% KC1 and 0.01% CaCl2) supplemented with 1 
p.g/ml of methylene blue. Expression of GFP was observed and photographed under a 

20 ZEISS Axiovert 25 fluorescence microscope. 

When zebrafish embryos received pCK-EGFP, GFP expression started about 4 
hours after injection, which corresponds to the stage of -30% epiboly. About 55% of the 
injected embryos expressed GFP at this stage. The early expression was always in the 
superficial layer of cells, mimicking endogenous expression of the CK gene as observed by 
25 in situ hybridization. At later stages, in all GFP-expressing fish, GFP was found 
predominantly in skin epithelia. A typical pCK-EGFP transgenic zebrafish fry at 4 days old 
is shown in Fig. 8. 

Under the MCK promoter, no GFP expression was observed in early embryos 
before muscle cells become differentiated. By 24 hpf, about 12% of surviving embryos 
30 expressed GFP strongly in muscle cells and these GFP-positive embryos remain GFP- 
positive after hatching. The GFP expression was always found in many bundles of muscle 
fibers, mainly in the mid-trunk region and no expression was ever found in other types of 
cells. A typical pMCK-EGFP transgenic zebrafish fry (3 days old) is shown in Fig. 9. 

Expression of pARP-EGFP was first observed 4 hours after injection at the 30%) 
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epiboly stage. The timing of expression is similar to that of pCK-EGFP-injected embryos. 
However, unlike the pCK-EGFP transgenic embryos, the GFP expression under the ARP 
promoter occurred not only in the superficial layer of cells but also in deep layers of cells. 
In some batches of injected embryos, almost 100% of the injected embryos expressed 
5 initially. At later stages when some embryonic cells become overtly differentiated, it was 
found that the GFP expression occurred essentially in all different types of cells such as 
skin epithelia, muscle cells, lens, neural tissues, notochord, circulating blood cells and yolk 
cells (Fig. 10). 

Under the MLC2f promoter, nearly 60% of the embryos expressed GFP. The 
10 earliest GFP expression started in trunk skeletal muscles about 19 hours after injection, 
which corresponds to the stage of 20-sornite. Later, the GFP expression also occurred in 
head skeletal muscles including eye muscles, jaw muscles, gill muscles etc. 

Transgenic founder zebrafish containing pMLC2f-EGFP emit a strong green 
fluorescent light under a blue or ultraviolet light (Fig. 1 1 A). When the transgenic founders 
15 were crossed with wild-type fish, transgenic offspring were obtained that also displayed 
strong green fluorescence (Fig. 11B). The level of GFP expression is so high in the 
transgenic founders and offspring that green fluorescence can be observed when the fish 
are exposed to sunlight. 

To identify the DNA elements conferring the strong promoter activity in skeletal 
20 muscles, deletion analysis of the 2-kb DNA fragment comprising the promoter was 
performed. Several deletion constructs, which contain 5' deletions of the MLC2f promoter 
upstream of the EGFP gene, were injected into the zebrafish embryos and the transient 
expression of GFP in early embryos (19-72 hpf) was compared. To facilitate the 
quantitative analysis of GFP expression, we define the level of expression as follows (Figs. 
25 12A-12C): 

Strong expression: GFP expression was detected in essentially 100% muscle fibers 
in the trunk. 



Moderate expression: GFP expression was detected in several bundles of muscle 
fibers, usually in the mid-trunk region. 

30 Weak expression: GFP expression occurred in dispersed muscle fibers and the 

number of GFP positive fibers is usually less than 20 per embryo. 

As summarized in Fig. 13, deletion up to -283 bp maintained the GFP expression 
in skeletal muscles in 100% of the expressing embryos; however, the level of GFP 
expression from these deletion constructs varies greatly. Strong expression drops from 



WO 00/49150 



PCT/SG99/00079 



-21 - 

23% to 0% from the 2-kb (-2011 bp) promoter to the — 283-bp promoter. Thus, only two 
constructs (-201 1 bp and -1338 bp) are capable of maintaining the high level of expression 
and the highest expression was obtained only with the 2-kb promoter, indicating the 
importance of the promoter region of -1338 bp to -2011 bp for conferring the highest 
5 promoter activity. 

The expression of GFP using pMLC2f-EGFP is much higher than that obtained 
using the pMCK-EGFP that contains a 1.5 kb of zebrafish MCK promoter. By the same 
assay in transient transgenic zebrafish embryos, only about 12% of the embryos injected 
with pMCK-EGFP expressed GFP. Among the expressing embryos, no strong expression 
10 was observed, and 70% and 30% showed moderate and weak expression, respectively. In 
comparison, about 60% of the embryos injected with pMLC2f-EGFP expressed GFP and 
23%, 37% and 40% showed strong, moderate and weak expression, respectively. 

Example IV: Potential applications of fluorescent transgenic fish 

The fluorescent transgenic fish have use as ornamental fish in the market. Stably 
15 transgenic lines can be developed by breeding a GFP transgenic individual with a wild type 
fish or another transgenic fish. By isolation of more zebrafish gene promoters, such as eye- 
specific, bone-specific, tail-specific etc., and/or by classical breeding of these transgenic 
zebrafish, more varieties of fluorescent transgenic zebrafish can be produced. Previously, 
we have reported isolation of over 200 distinct zebrafish cDNA clones homologous to 
20 known genes (Gong et aL, 1997). These isolated clones code for proteins in a variety of 
tissues and some of them are inducible by heat-shock, heavy metals, or hormones such as 
estrogens. By using the method of PCR amplification using gene- specific primers 
designed from the nucleotide sequences of these cDNAs, and the linker-specific primers 
described herein, the promoters of the genes represented by the cDNAs of Gong et al. can 
25 be used in the present invention. Thus, other tissue-specific promoters, hormone- inducible 
promoters, heavy-metal inducible promoters and the like from zebrafish can be isolated 
and used to make fluorescent zebrafish (or other fish species) that express a GFP or variant 
thereof, in response to the relevant compound. 

Multiple color fluorescent fish may be generated by the same technique as blue 
30 fluorescent protein (BFP) gene, yellow fluorescent protein (YFP) gene and cyan 
fluorescent protein (CFP) gene are available from Clonetech. For example, a transgenic 
fish with GFP under an eye-specific promoter, BFP under a skin-specific promoter, and 
YFP under a muscle- specific promoter will show the following multiple fluorescent colors: 
green eyes, blue skin and yellow muscle. By recombining different tissue specific 
35 promoters and fluorescent protein genes, more varieties of transgenic fish of different 
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fluorescent color patterns will be created. By expression of two or more different 
fluorescent proteins in the same tissue, an intermediate color may be created. For example, 
expression of both GFP and BFP under a skin-specific promoter, a dark-green skin color 
may be created. 

5 By using a heavy metal- (such as cadmium, cobalt, chromium) inducible or 

hormone- (such as estrogen, androgen or other steroid hormone) inducible promoter, a 
biosensor system may be developed for monitoring environmental pollution and for 
evaluating water quality for human consumption and aquacultural uses. In such a biosensor 
system, the transgenic fish will glow with a green fluorescence (or other color depending 

10 on the fluorescence protein gene used) when pollutants such as heavy metals and estrogens 
(or their derivatives) reach a threshold concentration in an aquatic environment. Such a 
biosensor system has advantages over classical analytical methods because it is rapid, 
visualizable, and capable of identifying specific compounds directly in complex mixture 
found in an aquatic environment, and is portable or less instrument dependent. Moreover, 

15 the biosensor system also provides direct information on biotoxicity and it is biodegradable 
and regenerative. 

Environmental monitoring of several substances can be accomplished by either 
creating one transgenic fish having genes encoding different colored fluorescent proteins 
driven by promoters responsive to each substance. Then the particular colors exhibited the 
20 fish in an environment can be observed. Alternatively, a number of fish can be 
transformed with individual vectors, then the fish can be combined into a population for 
monitoring an environment and the colors expressed by each fish observed. 

In addition, the fluorescent transgenic fish should also be valuable in the market for 
scientific research tools because they can be used for embryonic studies such as tracing cell 
25 lineage and cell migration. Cells from transgenic fish expressing GFP can also be used as 
cellular and genetic markers in cell transplantation and nuclear transplantation 
experiments. 

The chimeric gene constructs demonstrated successfully in zebrafish in the present 
invention should also be applicable to other fish species such as medaka, goldfish, carp 

30 including koi, loach, tilapia, glassfish, catfish, angel fish, discus, eel, tetra, goby, gourami, 
guppy, Xiphophorus (swordtail), hatchet fish, Molly fish, pangasius, etc. The promoters 
described herein can be used directly in these fish species. Alternatively, the homologous 
gene promoters from other fish species can be isolated by the method described in this 
invention. For example, the isolated and characterized zebrafish cDNA clones and 

35 promoters described in this invention can be used as molecular probes to screen for 
homologous promoters in other fish species by molecular hybridization or by PCR. 
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Alternatively, one can first isolate the zebrafish cDNA and promoters based on the 
sequences presented in SEQ ID NOS:l, 3, 5, 7, 8, 9, 20 and 22 or using data from other 
sequences of cDNAs disclosed by Gong et al. 1997, by PCR and then use the zebrafish 
gene "fragments to obtain homologous genes from other fish species by the methods 
mentioned above. 

In addition, a strong muscle-specific promoter such as MLC2f is valuable to direct 
a gene to be expressed in muscle tissues for generation of other beneficial transgenic fish. 
For example, transgenic expression of a growth hormone gene under the muscle-specific 
promoter may stimulate somatic growth of transgenic fish. Such DNA can be introduced 
either by microinjection, electroporation, or sperm carrier to generate germ-line transgenic 
fish, or by direct injection of naked DNA into skeletal muscles (Xu et al., 1999) or into 
other tissues or cavities, or by a biolistic method (gene bombardment or gene gun) 
(Gomez-Chiarri et al., 1996). 
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SEGUENCE LISTING 

<110> GONG, Zhiyuan 
LAM, Toong Jin 
JU, Bensheng 
XU, Yanfei 
— HE, Jiangyan 
YAN, Tie 

<120> CHIMERIC GENE CONSTRUCTS FOR GENERATION OF FLUORESCENT 
TRANSGENIC ORNAMENTAL FISH 

<130> 1781-0163P 

<140> 
<141> 

<160> 24 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 2480 

<212> DNA 

<213> Danio rerio 

<220> 

<221> CDS 

<222> (90) . . (1586) 

<220> 

<221> primer__bind 

<222> (66) . . (85) 

<223> CK2 

<220> 

<221> primer_bind 

<222> (97) . . (120) 

<223> CK1 

<220> 

<221> polyA_signal 
<222> (2446) . . (2451) 

<400> 1 

ctctcctttg tgagcaacct cctccactca ctcctctctc agagagcact ctcgtacctc 60 
cttctcagca actcaaagac acaggcatc atg tea acc agg tct ate tct tac 113 

Met Ser Thr Arg Ser lie Ser Tyr 
1 5 



tec age ggt ggc tec ate agg agg ggc tac acc age cag tea gec tat 
Ser Ser Gly Gly Ser lie Arg Arg Gly Tyr Thr Ser Gin Ser Ala Tyr 
10 15 20 



161 



gca gta cct gee ggc tct acc agg atg age tea gtg acc agt gtc agg 209 
Ala Val Pro Ala Gly Ser Thr Arg Met Ser Ser Val Thr Ser Val Arg 
25 30 35 40 



aga tct ggt gtg ggt gec age cca ggc ttc ggt gec ggt ggc age tac 
Arg Ser Gly Val Gly Ala Ser Pro Gly Phe Gly Ala Gly Gly Ser Tyr 
45 50 55 



257 



age ttt age age age age atg ggt gga ggc tat gga agt ggt ctt ggt 
Ser Phe Ser Ser Ser Ser Met Gly Gly Gly Tyr Gly Ser Gly Leu Gly 
60 65 70 



305 



t 
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gga ggt etc ggt ggg ggc atg ggc ttt cgt tgc ggg ctt cct ate aca 353 

Gly Gly Leu Gly Gly Gly Met Gly Phe Arg Cys Gly Leu Pro lie Thr 
75 80 85 ~ 

get gta act gtc aac cag aac ctg ttg gec ccc tta aac ctg gaa ate 401 
Ala_Val Thr Val Asn Gin Asn Leu Leu Ala Pro Leu Asn Leu Glu lie 
90 95 100 

gac ccc aca att caa get gtc cgc act tea gag aaa gag cag att aag 4 49 
Asp Pro Thr He Gin Ala Val Arg Thr Ser Glu Lys Glu Gin He Lys 
105 110 115 120 

acc ttc aac aac cgc ttc get ttc etc ate gac aaa gtg cgc ttc ctg 497 
Thr Phe Asn Asn Arg Phe Ala Phe Leu He Asp Lys Val Arg Phe Leu 
125 130 135 

gaa cag cag aac aag atg ctt gag acc aaa tgg agt ctt etc caa gaa 54 5 
Glu Gin Gin Asn Lys Met Leu Glu Thr Lys Trp Ser Leu Leu Gin Glu 
140 145 150 

cag aca acc aca cgt tec aac ate gat gec atg ttt gag gca tac ate 593 
Gin Thr Thr Thr Arg Ser Asn He Asp Ala Met Phe Glu Ala Tyr He 
155 160 165 

tct aac ctg cgc aga cag etc gat gga ctg gga aat gag aag atg aag 641 
Ser Asn Leu Arg Arg Gin Leu Asp Gly Leu Gly Asn Glu Lys Met Lys 
170 175 180 

ctg gag gga gag ctg aag aac atg cag ggc ctg gtt gag gac ttc aag 689 
Leu Glu Gly Glu Leu Lys Asn Met Gin Gly Leu Val Glu Asp Phe Lys 
185 190 195 200 

aac aag tac gag gat gag ate aac aag cgt get tec gta gag aat gag 737 
Asn Lys Tyr Glu Asp Glu lie Asn Lys Arg Ala Ser Val Glu Asn Glu 
205 210 215 

ttt gtc ctg etc aag aag gat gtt gat gca gec tac atg aac aag gtt 785 
Phe Val Leu Leu Lys Lys Asp Val Asp Ala Ala Tyr Met Asn Lys Val 
220 225 230 

gag ctt gaa gec aag gtt gat get ctt cag gat gag ate aac ttc etc 833 
Glu Leu Glu Ala Lys Val Asp Ala Leu Gin Asp Glu He Asn Phe Leu 
235 240 245 

agg gca gtc tac gag get gaa etc egg gag etc cag tct cag ate aag 881 
Arg Ala Val Tyr Glu Ala Glu Leu Arg Glu Leu Gin Ser Gin He Lys 
250 255 260 

gac aca tct gtt gtt gta gaa atg gac aac age aga aac ctg gat atg 929 
Asp Thr Ser Val Val Val Glu Met Asp Asn Ser Arg Asn Leu Asp Met 
265 270 275 280 

gac tec ate gtg get gaa gtt cgc get cag tat gaa gac ate gec aac 977 
Asp Ser He Val Ala Glu Val Arg Ala Gin Tyr Glu Asp He Ala Asn 
285 290 295 

cgc age cgt gec gag gca gag age tgg tac aaa cag aag ttt gag gag 1025 
Arg Ser Arg Ala Glu Ala Glu Ser Trp Tyr Lys Gin Lys Phe Glu Glu 
300 305 310 

atg cag age acc get ggt cag tat ggt gat gac etc cgc tea aca aag 1073 
Met Gin Ser Thr Ala Gly Gin Tyr Gly Asp Asp Leu Arg Ser Thr Lys 
315 320 325 



get gag att get gaa etc aac cgc atg ate gec cgc ctg cag aac gag 



1121 
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Ala Glu lie Ala Glu Leu Asn Arg Met lie Ala Arg Leu Gin Asn Glu 
330 335 340 

ate gat get gtc aag gca cag cgt gec aac ttg gag get cag att get 1169 
lie Asp Ala Val Lys Ala Gin Arg Ala Asn Leu Glu Ala Gin lie Ala 
345 — 350 355 360 

gag get gaa gag cgt gga gag ctg gca gtg aag gat gee aag etc cgc 1217 
Glu Ala Glu Glu Arg Gly Glu Leu Ala Val Lys Asp Ala Lys Leu Arg 
365 370 375 

ate agg gag ctg gag gaa get ctt cag agg gee aag caa gac atg gee 1265 
lie Arg Glu Leu Glu Glu Ala Leu Gin Arg Ala Lys Gin Asp Met Ala 
380 385 390 

cgc cag gtc cgc gag tac cag gag etc atg aac gtc aaa ttg get ctg 1313 
Arg Gin Val Arg Glu Tyr Gin Glu Leu Met Asn Val Lys Leu Ala Leu 
395 400 405 

gac att gag ate gee acc tac agg aaa ctg ttg gaa gga gag gag age 13 61 
Asp lie Glu lie Ala Thr Tyr Arg Lys Leu Leu Glu Gly Glu Glu Ser 
410 415 420 

aga ctg tec age ggt gga get caa get acc att cat gtt cag cag acc 14 09 
Arg Leu Ser Ser Gly Gly Ala Gin Ala Thr lie His Val Gin Gin Thr 
425 430 435 440 

tec gga ggt gtt tea tct ggt tat ggt ggt age ggc tct ggt ttc ggc 14 57 
Ser Gly Gly Val Ser Ser Gly Tyr Gly Gly Ser Gly Ser Gly Phe Gly 
445 450 455 

tac age agt ggc ttc age agt ggt ggg tea gga tac ggt agt gga tea 1505 
Tyr Ser Ser Gly Phe Ser Ser Gly Gly Ser Gly Tyr Gly Ser Gly Ser 
460 465 470 

gga ttc ggt tct gga tea ggg tat ggt gga ggc tec ate age aaa acc 1553 
Gly Phe Gly Ser Gly Ser Gly Tyr Gly Gly Gly Ser lie Ser Lys Thr 
475 480 485 

agt gtc acc acc gtc age agt aaa cgc tat taa ggagaagece gcccaaaccc 1606 
Ser Val Thr Thr Val Ser Ser Lys Arg Tyr 

490 495 

ccagccgaca cagtttccaa ccttccttac ctgeaactag atcccttctg aaccttctta 1666 

cgactcaaac catctatggt gctatatttt agecagacag ctgtcccctg ttaatgagga 1726 

gatgtggacg atgattttta aagtacaaaa taagttttag attgttctgt gtgttgatgg 1786 

tagttacccg tatcatgeat ctcctgtctg gtggtgtcac tgecatttta aatcatcaac 184 6 

ccaactacac taaaacgata ccaggaagaa tcgtgctcca agecactgaa tagtcttatt 1906 

tetgeactga tatgtacagg gaaagtgaga cacatagaaa ccactgtaac etaegtagta 1966 

ctatggtttc actggatcag gggtgtgcta tacaagttcc tgaatgtctt gtttgaatgt 2026 

tttgtgctgt tacaagctcc ctgctgtagt tttgetgact aatctgactt ttgtcatttt 2086 

gctatggctg tcagagttgg tttacctatt ttctataaaa tgtatatggc agtcagccaa 214 6 

taactgatga caattgettg tgggctacta atgtccagtt acctcacatt caagggagat 2206 

ctgttacagc aaaaaacagg cacaatggga tttatgtgga ccatccctcc ttaaccttgt 22 66 




t 
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gtactttccg tgttggaagt ggtgactgta ctgccttaca cattcccctg tattcaactg 2326 

gcttccagag catattttac atccccggtt ataaatggaa aatgcaagaa aactgaaaca 2386 

atgttcaacc agatttattt ggtattgatt gacgagacac caacttgaaa tttgaataca 2446 

ataaatctga gaccacaaaa aaaaaaaaaa aaaa 2480 

<210> 2 

<211> 498 

<212> PRT 

<213> Danio rerio 

<400> 2 

Met Ser Thr Arg Ser lie Ser Tyr Ser Ser Gly Gly Ser lie Arg Arg 
15 10 15 

Gly Tyr Thr Ser Gin Ser Ala Tyr Ala Val Pro Ala Gly Ser Thr Arg 
20 25 30 

Met Ser Ser Val Thr Ser Val Arg Arg Ser Gly Val Gly Ala Ser Pro 
35 40 45 

Gly Phe Gly Ala Gly Gly Ser Tyr Ser Phe Ser Ser Ser Ser Met Gly 
50 55 60 

Gly Gly Tyr Gly Ser Gly Leu Gly Gly Gly Leu Gly Gly Gly Met Gly 
65 70 75 80 

Phe Arg Cys Gly Leu Pro lie Thr Ala Val Thr Val Asn Gin Asn Leu 
85 90 95 

Leu Ala Pro Leu Asn Leu Glu lie Asp Pro Thr lie Gin Ala Val Arg 
100 105 110 

Thr Ser Glu Lys Glu Gin lie Lys Thr Phe Asn Asn Arg Phe Ala Phe 
115 120 125 

Leu lie Asp Lys Val Arg Phe Leu Glu Gin Gin Asn Lys Met Leu Glu 

130 135 140 

Thr Lys Trp Ser Leu Leu Gin Glu Gin Thr Thr Thr Arg Ser Asn lie 
145 150 155 160 

Asp Ala Met Phe Glu Ala Tyr lie Ser Asn Leu Arg Arg Gin Leu Asp 
165 170 175 

Gly Leu Gly Asn Glu Lys Met Lys Leu Glu Gly Glu Leu Lys Asn Met 
180 185 190 

Gin Gly Leu Val Glu Asp Phe Lys Asn Lys Tyr Glu Asp Glu lie Asn 
195 200 205 

Lys Arg Ala Ser Val Glu Asn Glu Phe Val Leu Leu Lys Lys Asp Val 
210 215 220 

Asp Ala Ala Tyr Met Asn Lys Val Glu Leu Glu Ala Lys Val Asp Ala 
225 230 235 240 
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Leu Gin Asp Glu lie Asn Phe Leu Arg Ala Val Tyr Glu Ala Glu Leu 
245 250 255 

Arg Glu Leu Gin Ser Gin lie Lys Asp Thr Ser Val Val Val Glu Met 
260 265 270 

Asp Asn Ser Arg Asn Leu Asp Met Asp Ser lie Val Ala Glu Val Arg 
275 280 285 

Ala Gin Tyr Glu Asp lie Ala Asn Arg Ser Arg Ala Glu Ala Glu Ser 
290 295 300 

Trp Tyr Lys Gin Lys Phe Glu Glu Met Gin Ser Thr Ala Gly Gin Tyr 
305 310 315 320 

Gly Asp Asp Leu Arg Ser Thr Lys Ala Glu He Ala Glu Leu Asn Arg 
325 330 335 

Met He Ala Arg Leu Gin Asn Glu He Asp Ala Val Lys Ala Gin Arg 
340 345 350 

Ala Asn Leu Glu Ala Gin He Ala Glu Ala Glu Glu Arg Gly Glu Leu 
355 360 365 

Ala Val Lys Asp Ala Lys Leu Arg He Arg Glu Leu Glu Glu Ala Leu 
370 375 380 

Gin Arg Ala Lys Gin Asp Met Ala Arg Gin Val Arg Glu Tyr Gin Glu 
385 390 395 400 

Leu Met Asn Val Lys Leu Ala Leu Asp He Glu He Ala Thr Tyr Arg 
405 410 415 

Lys Leu Leu Glu Gly Glu Glu Ser Arg Leu Ser Ser Gly Gly Ala Gin 
420 425 430 

Ala Thr He His Val Gin Gin Thr Ser Gly Gly Val Ser Ser Gly Tyr 
435 440 445 

Gly Gly Ser Gly Ser Gly Phe Gly Tyr Ser Ser Gly Phe Ser Ser Gly 
450 _ 455 460 

Gly Ser Gly Tyr Gly Ser Gly Ser Gly Phe Gly Ser Gly Ser Gly Tyr 
465 470 475 480 

Gly Gly Gly Ser He Ser Lys Thr Ser Val Thr Thr Val Ser Ser Lys 
485 490 495 

Arg Tyr 

<210> 3 

<211> 1589 

<212> DNA 

<213> Danio rerio 

<220> 
<221> CDS 

<222> (86) . . (1231) 
<220> 

<221> primer__bind 
<222> (6) . . (26) 
<223> MCK2 

<220> 



WO 00/49150 



PCT/SG99/00079 



-29- 

<221> primer_bind 
<222> (20) . . (38) 
<223> MCK1 

<220> 

<22%> polyA_signal 
<222> (1534) . . (1539) 

<400> 3 

cctatttcgg cttggtgaac aggatctgat cccaaggact gttaccactt ttgttgtctt 60 

ttgtgcagtg ttagaaaccg caatc atg cct ttc gga aac acc cac aac aac 112 

Met Pro Phe Gly Asn Thr His Asn Asn 
1 5 

ttc aag ctg aac tac tea gtt gat gag gag tat cca gac ctt age aag 160 

Phe Lys Leu Asn Tyr Ser Val Asp Glu Glu Tyr Pro Asp Leu Ser Lys 
10 15 20 25 

cac aac aac cac atg gec aag gtg ctg act aag gaa atg tat ggc aag 208 
His Asn Asn His Met Ala Lys Val Leu Thr Lys Glu Met Tyr Gly Lys 
30 35 40 

ctt agg gac aag cag acc cca cct gga ttc act gtg gat gat gtc ate 256 
Leu Arg Asp Lys Gin Thr Pro Pro Gly Phe Thr Val Asp Asp Val lie 
45 50 55 

cag act ggt gtt gac aat cca ggc cac ccc ttc ate atg acc gtc ggc 304 
Gin Thr Gly Val Asp Asn Pro Gly His Pro Phe lie Met Thr Val Gly 
60 65 70 

tgt gtt get ggt gat gag gag tec tac gat gtt ttc aag gac ctg ttc 352 
Cys Val Ala Gly Asp Glu Glu Ser Tyr Asp Val Phe Lys Asp Leu Phe 
75 80 85 

gac ccc gtc att tec gac cgt cac ggt gga tac aag gca act gac aag 4 00 
Asp Pro Val lie Ser Asp Arg His Gly Gly Tyr Lys Ala Thr Asp Lys 
90 95 100 105 

cac aag acc gac etc aac ttt gag aac ctg aag ggt ggt gat gac ctg 4 48 
His Lys Thr Asp Leu Asn Phe Glu Asn Leu Lys Gly Gly Asp Asp Leu 
110 115 120 

gac ccc aac tac ttc ctg age age cgt gtg cgt acc gga cgc age ate 4 96 
Asp Pro Asn Tyr Phe Leu Ser Ser Arg Val Arg Thr Gly Arg Ser lie 
125 130 135 

aag gga tac ccc ctg ccc ccc cac aac age cgt gga gag cgc aga get 54 4 
Lys Gly Tyr Pro Leu Pro Pro His Asn Ser Arg Gly Glu Arg Arg Ala 
140 145 150 

gtg gag aag ctg tct gtt gaa get ctg agt age ttg gat gga gag ttc 592 
Val Glu Lys Leu Ser Val Glu Ala Leu Ser Ser Leu Asp Gly Glu Phe 
155 160 165 

aag ggc aag tac tac ccc ctg aag tec atg act gat gac gag cag gag 64 0 
Lys Gly Lys Tyr Tyr Pro Leu Lys Ser Met Thr Asp Asp Glu Gin Glu 
170 175 180 185 

cag ctg ate get gac cac ttc etc ttt gac aaa ccc gtc tec ccc ctg 688 
Gin Leu lie Ala Asp His Phe Leu Phe Asp Lys Pro Val Ser Pro Leu 
190 195 200 

ctg ctg get get ggt atg gec cgt gac tgg ccc gat gec aga ggc att 7 36 
Leu Leu Ala Ala Gly Met Ala Arg Asp Trp Pro Asp Ala Arg Gly lie 
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205 210 215 

tgg cac aat gag aac aaa gcc ttc ctg gtc tgg gtg aaa cag gag gat 784 
Trp His Asn Glu Asn Lys Ala Phe Leu Val Trp Val Lys Gin Glu Asp 
220 225 230 

cac ctg cgt gtc att tec atg cag aag ggt ggc aac atg aag gaa gtg 832 
His Leu Arg Val lie Ser Met Gin Lys Gly Gly Asn Met Lys Glu Val 
235 240 245 

ttc aag cgc ttc tgc gtt ggt ctt cag agg att gag gaa att ttc aag 880 
Phe Lys Arg Phe Cys Val Gly Leu Gin Arg lie Glu Glu lie Phe Lys 
250 255 260 265 

aag cac aac cat ggg ttc atg tgg aac gag cat ctt ggt ttc gtc ctg 928 
Lys His Asn Has Gly Phe Met Trp Asn Glu His Leu Gly Phe Val Leu 
270 275 280 

acc tgc ccc tec aac ctg ggc aca ggc ctg cgc ggt gga gtc cac gtc 97 6 
Thr Cys Pro Ser Asn Leu Gly Thr Gly Leu Arg Gly Gly Val His Val 
285 290 295 

aag ctg ccc aag etc age aca cat gcc aag ttt gag gag ate ctg acc 1024 
Lys Leu Pro Lys Leu Ser Thr His Ala Lys Phe Glu Glu lie Leu Thr 
300 305 310 

aga ctg cgc ctg cag aag cgt ggc aca ggg ggt gtg gac acc get tec 1072 
Arg Leu Arg Leu Gin Lys Arg Gly Thr Gly Gly Val Asp Thr Ala Ser 
315 320 325 

gtt ggt gga gtg ttt gac att tec aac get gac cgt ate ggc tct tea 1120 
Val Gly Gly Val Phe Asp lie Ser Asn Ala Asp Arg lie Gly Ser Ser 
330 335 340 345 

gag gtt gag cag gtg cag tgt gtg gtt gat ggt gtc aag ctg atg gtg 1168 
Glu Val Glu Gin Val Gin Cys Val Val Asp Gly Val Lys Leu Met Val 
350 355 360 

gag atg gag aag aag ctg gga gaa ggc cag tec ate gac age atg ate 1216 
Glu Met Glu Lys Lys Leu Gly Glu Gly Gin Ser lie Asp Ser Met lie 
365 370 375 

cct gcc cag aag taa agegggagge ccttccattt ttttcttcgt ctttgtctgt 1271 
Pro Ala Gin Lys 
380 

ttttttacag tccaacagca aegsagagga aaactgctgc tcaaaaagac agtctcacct 1331 

ttgcacctgt cttctttcct ttttttccct tcttctctaa tttccatgtc atttcgecat 1391 

ctttttttcc actttgtttc ctattaagtc ggtaacatct tgggatcaga tacccggsgc 1451 

aggagtgagt gcttgttgct gaggcttcac ctcaatttca gccttggttg taaaaagtga 1511 

atcaatcaaa gttgtatttc aaaataaaaa tccccaataa aaaaaaaaaa aaaaaaaaaa 1571 

aaaaaaaaaa aaaaaaaa 1589 

<210> 4 

<211> 381 

<212> PRT 

<213> Danio rerio 



<400> 4 

Met Pro Phe Gly Asn Thr His Asn Asn Phe Lys Leu Asn Tyr Ser Val 
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lb 



Asp Glu Glu Tyr Pro Asp Leu Ser Lys His Asn Asn His Met Ala Lys 
20 25 30 

Val Xeu Thr Lys Glu Met Tyr Gly Lys Leu Arg Asp Lys Gin Thr Pro 
35 40 45 

Pro Gly Phe Thr Val Asp Asp Val He Gin Thr Gly Val Asp Asn Pro 
50 55 60 

Gly His Pro Phe He Met Thr Val Gly Cys Val Ala Gly Asp Glu Glu 
65 70 75 80 

Ser Tyr Asp Val Phe Lys Asp Leu Phe Asp Pro Val He Ser Asp Arq 
85 90 95 

His Gly Gly Tyr Lys Ala Thr Asp Lys His Lys Thr Asp Leu Asn Phe 
100 105 no 

Glu Asn Leu Lys Gly Gly Asp Asp Leu Asp Pro Asn Tyr Phe Leu Ser 
115 120 125 

Ser Arg Val Arg Thr Gly Arg Ser He Lys Gly Tyr Pro Leu Pro Pro 
130 135 140 

His Asn Ser Arg Gly Glu Arg Arg Ala Val Glu Lys Leu Ser Val Glu 
145 150 155 160 

Ala Leu Ser Ser Leu Asp Gly Glu Phe Lys Gly Lys Tyr Tyr Pro Leu 
165 170 175 

Lys Ser Met Thr Asp Asp Glu Gin Glu Gin Leu He Ala Asp His Phe 
180 185 190 

Leu Phe Asp Lys Pro Val Ser Pro Leu Leu Leu Ala Ala Gly Met Ala 
195 200 205 

Arg Asp Trp Pro Asp Ala Arg Gly He Trp His Asn Glu Asn Lys Ala 
210 215 220 

Phe Leu Val Trp Val Lys Gin Glu Asp His Leu Arg Val He Ser Met 
225 230 235 240 

Gin Lys Gly Gly Asn Met Lys Glu Val Phe Lys Arg Phe Cys Val Gly 



245 



250 



255 



Leu Gin Arg He Glu Glu lie Phe Lys Lys His Asn His Gly Phe Met 
260 265 270 

Trp Asn Glu His Leu Gly Phe Val Leu Thr Cys Pro Ser Asn Leu Glv 

2 ?5 280 285 

Thr Gly Leu Arg Gly Gly Val His Val Lys Leu Pro Lys Leu Ser Thr 
290 295 300 

His Ala Lys Phe Glu Glu He Leu Thr Arg Leu Arg Leu Gin Lys Ara 
305 310 315 320 

Gly Thr Gly Gly Val Asp Thr Ala Ser Val Gly Gly Val Phe Asp He 
325 330 335 

Ser Asn Ala Asp Arg He Gly Ser Ser Glu Val Glu Gin Val Gin Cys 
340 345 350 
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Val Val Asp Gly Val Lys Leu Met Val Glu Met Glu Lys Lys Leu Gly 
355 360 365 

Glu Gly Gin Ser lie Asp Ser Met lie Pro Ala Gin Lys 
370 375 380 

<210> 5 

<211> 1104 

<212> DNA 

<213> Danio rerio 

<220> 

<221> CDS 

<222> (75) . . (1034) 

<220> 

<221> primer_bind 
<222> (45) . . (64 ) 

<223> ARP2 q 
<220> 

<221> primer_bind 
<222> (87) . . (112) 
<223> ARP1 

<220> 

<221> polyA_signal 
<222> (1069) . . (1074) 

<400> 5 

cgcgtcccta ccgtgagatt ttacaacctt gtctttaaac cggctgttca ccgatccttg 60 

gaagcactgc aaag atg ccc agg gaa gac agg gcc acg tgg aag tec aac 110 
Met Pro Arg Glu Asp Arg Ala Thr Trp Lys Ser Asn 
15 10 

tat ttt ctg aaa ate ate caa ctg ctg gat gac ttc ccc aag tgt ttc 158 
Tyr Phe Leu Lys lie lie Gin Leu Leu Asp Asp Phe Pro Lys Cys Phe 
15 20 25 

ate gtg ggc gca gac aat gtc ggc tec aag cag atg cag acc ate cgt 206 
lie Val Gly Ala Asp Asn Val Gly Ser Lys Gin Met Gin Thr lie Arg 
30 35 40 

ctg tec ctg egg ggc aag gcc gtc gtg etc atg ggg aaa aac acc atg 254 
Leu Ser Leu Arg Gly Lys Ala Val Val Leu Met Gly Lys Asn Thr Met 
45 50 55 60 

atg agg aag gcc att cgt ggc cac ctg gaa aac aac cca get ctg gag 302 
Met Arg Lys Ala lie Arg Gly His Leu Glu Asn Asn Pro Ala Leu Glu 
65 70 75 

agg ctg ctt ccc cac ate cgc ggg aac gtg ggc ttc gtc ttc acc aag 350 
Arg Leu Leu Pro His lie Arg Gly Asn Val Gly Phe Val Phe Thr Lys 
80 85 90 

gag gat ctg act gag gtc cga gac ctg ctg ctg gca aac aaa gtg ccc 398 
Glu Asp Leu Thr Glu Val Arg Asp Leu Leu Leu Ala Asn Lys Val Pro 
95 100 105 

get get gcc cgt get ggt gcc ate gcc ccc tgt gag gtg act gtg ccg 44 6 
Ala Ala Ala Arg Ala Gly Ala lie Ala Pro Cys Glu Val Thr Val Pro 
110 115 120 



gcc cag aac acc ggg etc ggt cct gag aag acc tct ttc ttc cag get 



494 
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Ala Gin Asn Thr Gly Leu Gly Pro Glu Lys Thr Ser Phe Phe Gin Ala 
125 !3° 135 140 

ttg gga ate acc acc aag ate tec aga gga acc att gaa ate ttg agt 542 

Leu Gly He Thr Thr Lys He Ser Arg Gly Thr He Glu He Leu Ser 

"~ 145 150 155 

gac gtt cag ctt ate aaa cct gga gac aag gtg ggc gee age gag gee 590 

Asp Val Gin Leu He Lys Pro Gly Asp Lys Val Gly Ala Ser Glu Ala 
160 165 170 



acg ctg ctg aac atg ctg aac atg ctg aac ate teg ccc ttc tec tac 
Thr Leu Leu Asn Met Leu Asn Met Leu Asn lie Ser Pro Phe Ser Tyr 
175 180 185 



gcg gca get aca gag cag aaa tec get get cct gcg get aaa gag gag 
Ala Ala Ala Thr Glu Gin Lys Ser Ala Ala Pro Ala Ala Lys Glu Glu 
285 290 295 300 



acatctggaa acaaaaaaaa aaaaaaaaaa 



<210> 6 

<211> 319 

<212> PRT 

<213> Danio rerio 



<400> 6 

Met Pro Arg Glu Asp Arg Ala Thr Trp Lys Ser Asn Tyr Phe Leu Lys 
15 10 15 

He He Gin Leu Leu Asp Asp Phe Pro Lys Cys Phe lie Val Gly Ala 
20 25 30 



638 



ggg ctg ate ate cag cag gtg tat gat aac ggc agt gtc tac age ccc 686 
Gly Leu He He Gin Gin Val Tyr Asp Asn Gly Ser Val Tyr Ser Pro 
190 195 200 

gag gtg ctg gac ate act gag gac gee ctg cac aag agg ttc ctg aag 734 
Glu Val Leu Asp He Thr Glu Asp Ala Leu His Lys Arg Phe Leu Lys 
205 210 215 2 |o 

ggt gtg agg aac ate gec agt gtg tgt ctg cag ate ggc tac cca act 782 
Gly Val Arg Asn He Ala Ser Val Cys Leu Gin He Gly Tyr Pro Thr 
225 230 235 

ctt get tec ate cct cac act ate ate aat gga tac aag agg gtc ctq 830 
Leu Ala Ser He Pro His Thr He He Asn Gly Tyr Lys Arg Val Leu 
240 245 250 

get gtc act gtc gaa aca gac tac aca ttc ccc ttg get gag aag gtg 878 
Ala Val Thr Val Glu Thr Asp Tyr Thr Phe Pro Leu Ala Glu Lys Val 
255 260 265 

aag gee tac ctg get gat ccc acc get ttc get gtt gca gec cct gtt 926 
Lys Ala Tyr Leu Ala Asp Pro Thr Ala Phe Ala Val Ala Ala Pro Val 
27 0 275 280 



974 



gca ccc aag gag gat tct gag gag tct gat gaa gac atg ggc ttc ggc 1022 
Ala Pro Lys Glu Asp Ser Glu Glu Ser Asp Glu Asp Met Gly Phe Glv 
305 310 * 315 

ctg ttt gat taa accagacacc gaatatccat gtctgtttaa catcaataaa 1074 
Leu Phe Asp 

320 



1104 
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Asp Asn Val Gly Ser Lys Gin Met Gin Thr He Arg Leu Ser Leu Arg 
35 40 45 ' 

Gly Lys Ala Val Val Leu Met Gly Lys Asn Thr Met Met Arg Lys Ala 
-50 55 60 

He Arg Gly His Leu Glu Asn Asn Pro Ala Leu Glu Arg Leu Leu Pro 
65 70 75 80 

His lie Arg Gly Asn Val Gly Phe Val Phe Thr Lys Glu Asp Leu Thr 
85 90 95 

Glu Val Arg Asp Leu Leu Leu Ala Asn Lys Val Pro Ala Ala Ala Arg 
100 105 110 

Ala Gly Ala He Ala Pro Cys Glu Val Thr Val Pro Ala Gin Asn Thr 
115 120 125 

Gly Leu Gly Pro Glu Lys ' Thr Ser Phe Phe Gin Ala Leu Gly lie Thr 
130 135 140 

Thr Lys lie Ser Arg Gly Thr lie Glu He Leu Ser Asp Val Gin Leu 
145 150 155 160 

He Lys Pro Gly Asp Lys Val Gly Ala Ser Glu Ala Thr Leu Leu Asn 
165 170 175 

Met Leu Asn Met Leu Asn He Ser Pro Phe Ser Tyr Gly Leu He He 
180 185 190 

Gin Gin Val Tyr Asp Asn Gly Ser Val Tyr Ser Pro Glu Val Leu Asp 
195 200 205 

He Thr Glu Asp Ala Leu His Lys Arg Phe Leu Lys Gly Val Arg Asn 
210 215 220 

He Ala Ser Val Cys Leu Gin He Gly Tyr Pro Thr Leu Ala Ser He 
225 230 235 240 

Pro Hxs Thr He He Asn Gly Tyr Lys Arg Val Leu Ala Val Thr Val 
245 250 255 

Glu Thr Asp Tyr Thr Phe Pro Leu Ala Glu Lys Val Lys Ala Tyr Leu 
260 265 270 

Ala Asp Pro Thr Ala Phe Ala Val Ala Ala Pro Val Ala Ala Ala Thr 
275 280 285 

Glu Gin Lys Ser Ala Ala Pro Ala Ala Lys Glu Glu Ala Pro Lys Glu 
290 295 300 

Asp Ser Glu Glu Ser Asp Glu Asp Met Gly Phe Gly Leu Phe Asp 
305 310 315 



<210> 7 

<211> 2241 

<212> DNA 

<213> Damo rerio 

<220> 

<221> TATA^signal 
<222> (2103) . . (2108) 
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<220> 

<221> primer_bind 
<222> (2221) . - (2241) 
<223> CK2 

<22Q> 

<221> misc_feature 

<222> (2142) . . (2235) 

<223> Identical to the 5 T CK cDNA 

<400> 7 ^ n 
ccttcccttc tacttttgac gtccttttaa gagcttgtgc atgaaagcag atttggagct 60 

gattactcat ctcaaacacc catacaaagg gatgattgcc gtaccatgat ctcacacctt 120 

tcacacctgg tttatactat gatagttgta gacgattgcg taatgctatt aaatgcccat 180 

cagtgctggc tgtgacaccc aactgctgcc atttcgtgtt gacttgcacg agaaatgaga 24 0 

aattgtctga ctatgcaggg tgtctatgcg tgggaacatt tatcagtggt cattaaatac 300 

tatagtttac agttagacca aagtgtgctg tatttttgtg ttagcttagc tgcagttttt 360 

gtgtgtgaag taacaaatga caaatactca aactattgta attaagtagt ttttctcaga 420 

aattgtaatt tactaagtag tttaaaaatg tgtactttta ctttcccttg agtacatttt 480 

tagtgcagtg ttggtacttt tatttcactt ccttccttca acctgcagtc actactttat 540 

ttattcttgt ctatgtggat tagacaaatc agtcctgtga ttcctgtcca atcaaattgc 600 

acatagaagg taaatcacat cataatgaac taccttaaga catgggccat ttataattgc 660 

agcaaactgt ttgccagcat taaaagaaga tgtcaaaaat atttacacgc attaacccag 720 

agactgctta gatgcatgtc actgatgaga agatgatgga tgtttactgt atgatgaccg 780 

aaataacttt aaacgcacac aagacggcac aagacgtcaa catggcgtta ggttgacgtt 840 

gtaccccaac gcagtgggga cgttgcattt tgtttagaaa tgaaaattag gttgacgtca 900 

gaactcaacg tcaggtcgat gtcaatgttc aacatccaat ctaaaatcat atatcaatgt 960 

ctaatgatgt tacagcttga tgttatgcgg atgttacccc tatgacgtct atcagacgtt 1020 

ggattatggt tgccatacct gatgaataaa tgtcattatt tgacgttggt ttaagatgtt 1080 

ggttcgacat tggattttgg tcgctttcca acacaaccta aatccaccaa atattaactt 1140 

cctatgacat cgttattgga cgtcaaaata acaatatcct tagatgctgg ctagactttg 1200 

aatttaggtc accacaacct atatttaacc taatattaac atcttatgat gttgtgtgcc 1260 

tgctgggcaa taactaaatg cactacagaa tgttacgttt acacacatgt aaattacatg 1320 

taaatgcatc agcttttcac agcataatac tcactactta ctactcttga gtacttttaa 1380 

aaaagctact tttcactcat actttgagta atatttacaa ctgatacttt tactcgcact 1440 

acatttttag gcatgtattg atatttttac tatgattttt cagtactctt tccactactg 1500 

cagccctccc catacataat cgtatgttta cacatatggt ggagtttaga gccataatct 1560 

acattagctt tgttagccgc tagcattact gtgcagaatt gtgtgtgtgc acattttcca 1620 



t 
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atatcaatac agaaggaaac tgtgttccct gttcccttgt aaatctcaac aatgcaactg 1680 

ttcagctcag ggggaaaaat gccctgccag atccaaacgg ctggcaaaag tgaatggaaa 1740 

aaagcctttc attaatgtga aagttgctgc gcgccccacc cagataaaaa gagcagaggt 1800 

taacatgctc tctacggctg tccagccaac cagatactga ggcagaaaca cacccgctgg 1860 

cagatggtga gagctacact gtcttttcca gagtttctac tggaatgcct gtcctcaagt 1920 

ctcaagcctc tccttgcatt ctctcattcc acctggggca aagccccagg ctgggtgtga 1980 

caacatttat cttaccactt tctctctgta cctgtctaac aggtagggtg tgtgtgagag 2040 

tgcgtatgtg tgcaagtgcg tgtgtgtgtg agagcagtca gctccaccct ctcaagagtg 2100 

tgtataaaat tggtcagcca gctgctgaga gacacgcaga gggactttga ctctcctttg 2160 

tgagcaacct cctccactca ctcctctctc agagagcact ctcgtacctc cttctcagca 2220 
actcaaagac acaggatccg g 224 1 

<210> 8 

<211> 1456 

<212> DNA 

<213> Danio rerio 

<220> 

<221> TATA_signal 
<222> (1389) . - (1394) 

<220> 

<221> primer^bind 

<222> (1433) . . (1456) 

<223> MCK2 

<220> 

<221> misc_f eature 
<222> (1428) . . (1453) 

<223> Identical to the 5' MCK cDNA 
<400> 8 

gaattgcaaa gtcagagtaa taaaatgaaa ccaaaaaaca tttttaaata tacttgtctc 60 
tgtggcttaa tcttggctga tgtgtgtgtg tgtgtgtgtg tacttgacag ctgctagtga 120 
gcatgtgcac catgacaggc ctgttattca cacttggtgc catgttggag actgttcggc 180 
cagctatagt tttcttcaca gagtcctggg tcacctaatg tcacaaggaa gaaacatgtt 240 
acatgttaaa atgtgacatt caaattgtag tgcattactt aacgaaacgc attacacaag 300 
ttacagctta aaagattgct agacagaaaa accagggagg ggttttccca taatatccag 360 
tgagactcta ggagcgggaa cactaacagg cctccctgag tgagaacatt gcatgtgcgc 420 
gtgacagaaa accagagatg gaaatacctt cttttgaatt gcataattgc ttaaaagaag 480 
acacaacagg gatagttcac ccaaaaaaca gaccattctt tttttctgtt gaacaaaaat 540 
taagatattt tgaagaatgc ttaccgaata acttccatat ttggaaacta attacagtga 600 
aagtcaatgg gtcttccagc attttttcaa tataccttac tttgagttca aaagaaaaac 660 
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acatctcaaa 


taggtttgag 


gttgaataaa 


catttttcat 


tttggggtgg 


actatcccta 


720 


attatttgac 


acttaagatt 


tatagtaaat 


cattttatag 


actttctccc 


cttattaaac 


780 


atggttgaat 


ttatcttcat 


gtttatgtct 


gggttgtgct 


tttttgaaaa 


gatttccctg 


840 


tcaaatgttt 


ttgtgtatgg 


ttggcgcaca 


atagactgaa 


ctggcctatc 


acacagactt 


900 


tcataacaac 


tccagttgat 


gccctttcac 


cctcagtgta 


taaatatggc 


gtctgacatg 


960 


agcagattaa 


acacgacact 


gcaacaactt 


tacctgtaaa 


aatacaaatt 


gagtttgcac 


1020 


ccagaatcat 


gtggtgaacg 


aagcctacca 


agagattttt 


gaaagccatc 


ggcctgacac 


1080 


gcgcacttct 


gatatctgtg 


gtatgtttgg 


caaaagtgct 


gctcagcctt 


tttagcatgg 


1140 


cagatcctcc 


acatcccatc 


acccctcctt 


caacctattc 


cctcctggaa 


agctatgtat 


1200 


ggggcgggaa 


gtgtaaatgg 


atatgggaag 


gaaggggggc 


accacccaca 


gctgccacct 


1260 


catctaggat 


gcctggggcc 


taaattgaag 


cctttcttac 


actaaacagg 


gcataagaga 


1320 


ccagcgccag 


ccaatcataa 


ttcagtgagc 


tctaaaatgg 


gccagccaat 


ggctgcaggg 


1380 


gctagaggta 


tatatatcca 


aatcaaactc 


ttcttgcttg 


ggtgacccct 


atttcggctt 


1440 


ggtgaacagg 


atccgg 










1456 



<210> 9 

<211> 2205 

<212> DNA 

<213> Danio rerio 

<220> 

<221> primer_bind 
<222> (2179) . . (2205) 
<223> ARP2 

<220> 

<221> misc_feature 

<222> (2153) - . (2199) 

<223> Identical to the 5' ARP cDNA 
<220> 

<221> intron 

<222> (792) . . (2152) 

<220> 

<221> misc_f eature 
<222> (775) . . (791) 

<223> Identical to the 5' ARP cDNA 
<400> 9 

atctgtatta agaaacactt aaaatatata tgcgttacga attaaaaaca aaacacgatc 60 
attttaattt gtgttgtata attttacatt ttgtaagtat tatttttata aaaaatatat 120 
agaaataata caaatttgtt tacagtattc ttagttattg caataaacga attttatata 180 
gaaagagaaa gagttttatt ataagatgtt caatttaaaa aatggcagaa aatagaaaaa 240 
tgattgtcaa gatgataaaa gtcagtttag acaaaaaaat aagatgaaaa acatcaaaat 300 
agataataaa gtgacttttt tgggcggacc aaatttccct attaatggtc aattcattaa 360 
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ggtattgega 


yuyL.ciyelL.l-L 




ttagaaggga 






aattatccat 


tcaaccctca 


cagt aatgt a 


aaaatattat 


tatattgacc 


gcagct gt at 


cctttctaag 


cagacaegtg 


cattgaccaa 


t cagcgcaca 


atgatcgact 


gat actaata 


ttgtgccgct 


cctaccgtga 


ggtaaggctg 


acgccgctct. 


taaca teat a 


agaggt cacg 


agaaggtct a 


tatgcgttt a 


aagct tgtgt 


aatgatt tt t 


caggect cgt 


gegecat gtg 


t gaegegacg 


tccgattctg 


atattaatca 


tatttatgeg 


tt a cat get a 


gt acttgtgc 


tagteggteg 


ttttaaaacc 


agt t act etc 


attttagtga 


CLddLLaLdt 


aiLtaclT-tguyt 


LLatgtLLLd 


LtLdLdLtdy 


tat gt acaat 


ttggcataaa 


gtaaagttaa 


t ct taaaggg 


gt aaaggct c 


tcaaatctta 


t gagttt ctt 


aa tgaacatg 


accataatac 


aaa t acagga 


aaaat a tact 


L L O ci cl ci cl Ld L 


L-ciL-dtJciciy l- 


/T 4— -4- -4— —j 4— /_ rt — * 

yttLaatyya 


uvji ct L d ci d L L, 




4— fr ff ff 4— ft — \ - 4- 

T_gggT_gagcL\ 


y^dy Ly y l\ l-ca 


^ a /"t f "4— "t - rr 

LCdatJCLtyt 


l LCtgdayyy 


UuCt L. L. CtCl a L> Cl 


LaL-L Ly dciLd 


dLjLLdaLCad 


y L,ciy y Ly uy l 


tgatgeaaga 


tagagct aaa 


gtgacccctg 


cctcaagcca 


teacaaatge 


ttatggacag 


gctgttgcag 


tgcttgttcg 


cttctatccc 


tgtctgtctg 


catctcatga 


atagcagtaa 


atcaaataca 


atagtgetet 


taggcaacca 


aattacctgg 


aaacagttta 


aagatgeaca 


caaggcaggt 


gtaaaagtat 


accttgt ctt 


taaacegget 


gttcaccgat 


<210> 10 
<211> 24 
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tgaatttaga 


tgcacagtga 


ttttggttct 


4 20 


tacatctgeg 


gecgaaagt t 


aacgggaact 


4 80 


ta ttt gtatt 


t tattacccc 


aacegtaaac 


540 


ttattgttt t 


atagegtcac 


agaatgat gc 


600 


t gcgactgta 


caaatacgea 


ctgaccgtga 


660 


gataegcat t 


ttccgcgcga 


ttctgattgg 


720 


tcctttcgcg 


gcctctttct 


ttcacgcgtc 


780 


tgtggcggtt 


tcttaaaatg 


tgttaataaa 


840 


cgtgtgttta 


atatcagegg 


eggttattat 


900 


acagt aaaag 


ttagcactag 


cctgttagca 


960 


ttttaatagc 


atcttatttg 


attttgatga 


1020 


taaaatgtgt 


gatgggtctg 


ctagtggaca 


1080 


atccacatt g 


agatgttgcg 


etatttgeca 


1140 


aatattctta 


agecactaag 


ttaaaatttg 


1200 


t t tgagtcat 


cataccaggt 


aatagtttta 


12 60 


ctgect tegg 


ttttgattga 


catctacttt 


1320 


acccaaaaga 


caattcaccg 


tcaagtgt tt 


138 0 


j-_4~ _^ +- n 4—4—4- 4— 

gt at gt ttt g 


gagaaaactg 


gaaaccaact 


14 4 0 


a t agaagt eg 


atggttacag 


gttttctgea 


1500 


aggaactcaa 


gtgattt gaa 


aagttaaggg 


1 5 60 


gt ct ct aaac 


at ttgat tta 


gacacctcag 


1 62 0 


ccagtgtcct 


acagatttta 


gctccaaccc 


1 680 


ggtct tact a 


ggt at gtttg 


aaacat ccag 


1740 


ccctgcaggg 


acaatggccc 


aacaggattg 


1800 


attatggtat 


taagaaatgt 


gcaggttcag 


1860 


tcgttcccac 


tgcacaaatg 


aacatgattc 


1920 


ettgeaggga 


cgctggtctc 


agacacgttt 


1980 


gattatcttt 


aaatatttga 


aagcttataa 


2040 


caaacagtaa 


ttcatatttt 


gtcatttaat 


2100 


tgcttgtgtt 


tgtaatcctc 


agattttaca 


2160 


ccttggaagg 


gatcc 




2205 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cytokeratin 
— gene specific primer 

<400> 10 

cgctggagta agagatagac ctgg 

<210> 11 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cytokeratin 
gene specific primer 

<220> 

<221> misc_f eature 
<222> (1) . . (6) 

<223> Introduced for restriction site 
<220> 

<221> misc_feature 
<222> (3).. (8) 
<223> BamHI site 

<400> 11 

ccggatcctg tgtctttgag ttgctg 

<210> 12 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Muscle 
creatine kinase gene specific primer 

<220> 

<221> misc^feature 
<222> (3) . . (8) 
<223> BamHI site 

<400> 12 

ccggatcctt gggatcagat cctg 

<210> 13 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Muscle 
creatine kinase gene specific primer 

<220> 

<221> mis c_f eature 
<222> (1) . . (3) 

<223> Introduced for restriction site 
<220> 

<221> raise feature 



# 
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<222> (3) . , (8) 
<223> BamHI site 

<400> 13 

ccggatcctg ttcaccaagc cgaa 24 

<210> 14 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Acidic 
ribosomal protein PO gene specific primer 

<40Q> 14 

tagttggact tccacgtgcc ctgtc 25 




<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Acidic 
ribosomal protein PO gene specific primer 

<220> 

<221> mis cofeature 
<222> (1) . . (7) 

<223> Introduced for restriction site 
<220> 

<221> luisc^feature 
<222> (1) . . (6) 
<223> BamHI site 

<400> 15 

ggatcccttc caaggatcgg tgaaca 26 

<210> 16 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Oligonucleotide for linker used in linker-mediated 
PCR 

<400> 16 

gttcatcttt acaagctagc gctgaacaat gctgtggaca agcttgaatt c 51 



<210> 17 
<211> 10 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Oligonucleotide for linker used in linker-mediated 
PCR 




WO 00/49150 



PCT/SG99/00079 



-41 - 



<220> 



<223> n is a dideoxycytidine 



<400^ 17 
gaattcaagn 



10 



<210> 18 
<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker 
specific primer 

<400> 18 

gttcatcttt acaagctagc g 21 



<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker 
specific primer 



<210> 20 

<211> 1392 

<212> DNA 

<213> Danio rerio 

<220> 

<221> CDS 

<222> (42) . . (551) 

<220> 

<221> prime r_bind 
<222> (6) . . (28) 
<223> M2 

<220> 

<221> primer_Jbind 
<222> (23) . . (45) 
<223> Ml 

<220> 

<221> polyA_signal 
<222> (797) . . (802) 

<220> 

<221> polyA_signal 
<222> (1351) . . (1357) 



<400> 19 

tcctgaacaa tgctgtggac 



20 



<400> 20 

ctcttcttga tcttcttaga cttcacacat accgtctcga c atg gca ccc aag aag 56 
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Met Ala Pro Lys Lys 
1 5 



gcc aag agg agg gca gca gga gga gag ggt tec tec aac gtc ttc tec 
Ala Lys Arg Arg Ala Ala Gly Gly Glu Gly Ser Ser Asn Val Phe Ser 
- 10 15 20 

atg ttt gag cag age cag att cag gag tac aaa gag get ttc aca ate 
Met Phe Glu Gin Ser Gin He Gin Glu Tyr Lys Glu Ala Phe Thr He 
25 30 35 

att gac cag aac aga gac ggt ate ate age aaa gac gac ctt agg gac 
He Asp Gin Asn Arg Asp Gly He He Ser Lys Asp Asp Leu Arg Asp 
40 45 50 

gtg ttg gcc tea atg ggc cag ctg aat gtg aag aat gag gag ctg gag 

Val Leu Ala Ser Met Gly Gin Leu Asn Val Lys Asn Glu Glu Leu Glu 
55 60 65 

gcc atg ate aag gaa gcc age ggc cca ate aac ttc ace gtt ttc etc 
Ala Met He Lys Glu Ala Ser Gly Pro He Asn Phe Thr Val Phe Leu 
70 75 80 85 

acc atg ttc gga gag aag ttg aag ggt get gac ccc gaa gac gtc ate 
Thr Met Phe Gly Glu Lys Leu Lys Gly Ala Asp Pro Glu Asp Val He 
90 95 100 

gtg tct gcc ttc aag gtg ctg gac cct gag ggc act gga tec ate aag 

Val Ser Ala Phe Lys Val Leu Asp Pro Glu Gly Thr Gly Ser He Lys 
105 HO 115 

aag gaa ttc ctt gag gag ctt ttg acc act cag tgc gac agg ttc acc 
Lys Glu Phe Leu Glu Glu Leu Leu Thr Thr Gin Cys Asp Arg Phe Thr 
120 125 130 

gca gag gag atg aag aat ctg tgg gcc gcc ttc ccc cca gat gtg get 
Ala Glu Glu Met Lys Asn Leu Trp Ala Ala Phe Pro Pro Asp Val Ala 
135 140 145 

ggc aat gtt gac tac aag aac ate tgc tac gtc ate aca cac gga gag 
Gly Asn Val Asp Tyr Lys Asn He Cys Tyr Val He Thr His Gly Glu 
150 155 160 165 



104 



152 



200 



248 



296 



344 



392 



440 



488 



536 



gag aag gag gag taa acaaccttgg aatcaagaaa acgaagagaa gaacatgeat 5 91 
Glu Lys Glu Glu 

170 

cctcacagct taatctccag tctgttgtct ggccttctct aacttttgtt tttccttcct 651 
ccctttcttg ctttctacca tcgttgttac tccaagcact tacactctcc atcttaccaa 711 
agacttgtct cgctgggact gaattgggag ggtggagagg aacacgacca cagtgtctgt 771 
cgagtgggga catgggattg ttttcaataa aatgaacatc atttctgtat ctctcacatt 831 
ctctctttct ctctgtttct cactcattac ccacaacccc tctctttcat ttcagtcaag 891 
cttgcatgta agtegctget tettctgetg cagtcttagg agttgaaacg aaggcatcta 951 
tagtttgggg ctgaaacatc tctctagatc aatgtggaag agtgctcact ctgaggggga 1011 
aagaagcacg atggagtgat ctcactctat aatagaggaa ccagtcatca ttctcatttc 1071 



t 
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ctcctctggt ggttgactaa aaagagaaag agaaaatgag ggttttgtgc tgagtgagtt 1131 

tagcctccta aaagcgatgc cgagctcatc acagagggag tgagagggac agaccatcct 1191 

aggaagagag gagagcaggg actgaaagaa aacataacct cttcactccc cctctcccct 1251 

cctcttctct atttctctgt ccatcttttc ttttttcttt tttctttttt gctttctgca 1311 

tctgggcctg ctttgctctg ccaaacctct cctgtaacca ataaaaagac acaaactgtg 1371 

aataaaaaaa aaaaaaaaaa a 1392 

<210> 21 

<211> 169 

<212> PRT 

<213> Danio rerio 

<400> 21 

Met Ala Pro Lys Lys Ala Lys Arg Arg Ala Ala Gly Gly Glu Gly Ser 
15 10 15 

Ser Asn Val Phe Ser Met Phe Glu Gin Ser Gin lie Gin Glu Tyr Lys 
20 25 30 

Glu Ala Phe Thr lie lie Asp Gin Asn Arg Asp Gly lie lie Ser Lys 
35 40 45 

Asp Asp Leu Arg Asp Val Leu Ala Ser Met Gly Gin Leu Asn Val Lys 
50 55 60 

Asn Glu Glu Leu Glu Ala Met lie Lys Glu Ala Ser Gly Pro lie Asn 
65 70 75 80 

Phe Thr Val Phe Leu Thr Met Phe Gly Glu Lys Leu Lys Gly Ala Asp 
85 90 95 

Pro Glu Asp Val lie Val Ser Ala Phe Lys Val Leu Asp Pro Glu Gly 
100 105 110 

Thr Gly Ser lie Lys Lys Glu Phe Leu Glu Glu Leu Leu Thr Thr Gin 
115 120 125 

Cys Asp Arg Phe Thr Ala Glu Glu Met Lys Asn Leu Trp Ala Ala Phe 
130 135 140 

Pro Pro Asp Val Ala Gly Asn Val Asp Tyr Lys Asn lie Cys Tyr Val 
145 150 155 160 

lie Thr His Gly Glu Glu Lys Glu Glu 
165 

<210> 22 

<211> 2054 

<212> DNA 

<213> Danio rerio 

<220> 

<221> TATA_signal 
<222> (1983) . . (1989) 

<220> 

<221> enhancer 
<222> (142) . . (148) 
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<223> E-box, canntg 
<220> 

<221> enhancer 
<222> (452) . . (457) 
<223> E-box, canntg 

<220> 

<221> enhancer 
<222> (1095) . . (1100) 
<223> E-box, canntg 

<220> 

<221> enhancer 
<222> (1278) . . (1283) 
<223> E-box, canntg 

<220> 

<221> enhancer 
<222> (1362) . . (1367) 
<223> E-box, canntg 

<220> 

<221> enhancer 
<222> (1385) . . (1390) 
<223> E-box, canntg 

<220> 

<221> enhancer 
<222> (523) . . (532) 

<223> Potential MEF2 binding site, yta(w)4tar 
<220> 

<221> enhancer 
<222> (606) . - (615) 

<223> Potential MEF2 binding site, yta(w)4tar 
<220> 

<221> enhancer 
<222> (697) . . (706) 

<223> Potential MEF2 binding site, yta(w)4tar 
<220> 

<221> enhancer 
<222> (1490) . . (1499) 

<223> Potential MEF2 binding site, yta(w)4tar 
<220> 

<221> enhancer 
<222> (1640) . . (1649) 

<223> Potential MEF2 binding site, yta (w) 4tar 
<220> 

<221> enhancer 
<222> (1956) . . (1965) 

<223> Potential MEF2 binding site, yta(w)4tar 
<220> 

<223> Transcription start site at residue 2012 
<220> 

<221> primer_bind 
<222> (2032) . . (2054) 
<223> M2 



# 
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-45 

<220> 

<221> misc_dif f erence 
<222> (2027) . . (2054 ) 

<223> Identical to the 5' MLC2 f cDNA 
<40O> 22 



tgcatgcctg 


gcaggtccac 


tctagaggac 


tactagtcat 


atgcgattct 


gaacaatgct 


60 


gtggacagct 


tgaattcaat 


t cgccacaga 


ggaatgagcc 


accaactcat 


ccagtgtatt 


120 


ttttatgcag 


cggatgccca 


tccagttgca 


accctacact 


gggaaacacc 


caaatctgtc 


180 


ttttaaatta 


atatttttaa 


taggaagcta 


tgttatattt 


gtgcatatac 


attagattag 


240 


tcagtacact 


ctcagaaata 


aatggtccgc 


aagctgtcac 


tgcggtggta 


ccttttcaaa 


300 


aggtacatat 


ttgtacttaa 


agggtccata 


ttgatacctc 


aaaagtatat 


attagtgcct 


360 


aaaaatttta 


agagaaacac 


ttttgtactt 


tttaggtact 


aatatatacc 


cttgaggttt 


420 


tattatggac 


ctttaaggta 


caaattttta 


tcatttggaa 


aggtaccacc 


ccagtgacag 


480 


ctcacgccct 


ttatttctga 


gagtgaagct 


aaatctggag 


cttatttaac 


aaaataactt 


540 


atgataacgg 


tccaaaaact 


actacaccca 


tatttatatg 


ttattgaaaa 


atattaaata 


600 


caaatttata 


aacaggggaa 


aatcaagaga 


agcaaaacaa 


tggaaaaaaa 


ttagttcaaa 


660 


atttgtaggt 


tgtaattttt 


ttgttgcaat 


atttagcttt 


atttaaattg 


ttttatcttt 


720 


caattttgtg 


actaaaatat 


tattttaata 


aatatagctg 


tttaataaat 


ctgttttgtt 


780 


taaatgcacc 


aaaatacatt 


gcctatattc 


actgagagat 


ggagaaaaat 


attcattttc 


840 


aaaatgggct 


gtactcaatt 


actctaagca 


ctgtaattat 


ctgtttttct 


aggtactgta 


900 


aaataatgct 


aatcttaaac 


aattttggaa 


caagcaggag 


caaaacaagc 


cgacagactc 


960 


acaggatatc 


aaattagtct 


aaagaaagaa 


gggaatgcac 


aacaaactca 


aggggggcaa 


1020 


aacagtgact 


gatgaattag 


acaagaaaaa 


gagagcaagg 


agcgctcaag 


attgtttagc 


1080 


tattttggtc 


accacagctg 


ttccttatgc 


ctgccttccc 


aaaaaaaaac 


tgtcttaagc 


1140 


ctcaaatttc 


tcttcatgag 


ggtccaacat 


caaccactca 


gagggctgta 


gtgtgctgac 


1200 


catctaaaaa 


ctgggaaaaa 


ggggtaatta 


cgtgcttgtc 


cacagggcag 


cttcccacaa 


1260 


atggcacctc 


acagtcactg 


aagtgaccgg 


gtgaggtcat 


aggtcgatcg 


gcagagagag 


1320 


agagagagag 


agagagagag 


agtgctgaat 


atgggcatgg 


ccatgtggac 


gggtgtggtg 


1380 


ggggcacttg 


aaccgaaatc 


ttacagcatc 


actatactag 


gaaaagcatt 


aaaacctatt 


1440 


ttcgattagg 


gctgatttga 


aataagggtt 


aagacaccag 


aacgtcctct 


tatatatcag 


1500 


ccagggcgtg 


aacaaatatg 


aacaacataa 


tcattggctc 


aaaaatctct 


ggattgaaat 


1560 


ccatcaggat 


ctat cactgc 


aaccctcccc 


atccaacaag 


taatcctgca 


agataagcca 


1620 


gtattgatct 


gctgctaatc 


taactttagg 


ggagggagtt 


gcgtcccctt 


tagactcagt 


1680 


ggctacagct 


cattcatttc 


aaattgagtt 


atgtgattgt 


atgaagctca 


aacagtccct 


1740 


tacgtcccca 


tgtccttatt 


agacaacgcg 


agacatgcag 


gccgctgcca 


tcagtatcag 


1800 



PCT/SG99/00079 

4 
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attcatccca ttccaagact ccaatagcta tttctgagca ctgtaagatg atagtacatc 1860 

ccagccggtg tccctccatc actttccccc tacctcatag tttttcctct ttctctctcg 1920 

gtctxEfCtatt tcccaaacct cacttaaggt tgggtctata attagcaagg ggccttcgtc 1980 

agtatataag cccctcaagt acaggacact acgcggcttc agacttctct tcttgatctt 204 0 



<210> 23 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: MLC2F gene 
specific primer Ml 

<400> 23 

ccatgtcgag acggtatgtg tga 23 



<210> 24 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: MLC2F gene 
specific primer M2 



cttagacttc 



acac 



2054 



<400> 24 

gtgtgaagtc taagaagatc aag 



23 
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CLAIMS 

1. A zebrafish cytokeratin gene promoter which is capable of directing a 
structural gene to be predominantly expressed in skin epithelia when it is inserted in front 
of the structural gene and introduced into fish embryos. 



2. A zebrafish muscle creatine kinase gene promoter which is capable of 
5 directing a structural gene to be specifically expressed in muscles when it is inserted in 

front of the structural gene and introduced into fish embryos. 

3. A zebrafish fast skeletal muscle isoform of myosin light chain 2 gene 
promoter which is capable of directing a structural gene to be predominantly expressed in 
skeletal muscles when it is inserted in front of the structural gene and introduced into fish 

10 embryos. 

4. A zebrafish acidic ribosomal protein PO gene promoter which is capable of 
directing a structural gene to be expressed ubiquitously in all tissues when it is inserted in 
front of the structural gene and introduced into fish embryos. 

5. A recombinant DNA molecule comprising a structural gene and the 
15 promoter of claim 1, 2, 3 or 4 arranged upstream of said structural gene. 

6. A chimeric gene comprising the promoter of claim 1, 2, 3 or 4, operatively 
linked to DNA encoding a protein selected from the group consisting of GFP, modified 
GFP, EGFP, BFP, EBFP, YFP, EYFP, CFP, ECFP, luciferase, B-galactosidase, 
chloramphenicol acetyltransferase, and growth hormone. 

20 7. A transgenic fish comprising a chimeric gene comprising the promoter of 

claim 1,2, 3 or 4. 

8. The transgenic fish of claim 7, which contains said promoter in germ cells 
and/or in somatic cells and which is capable of breeding with either a said transgenic fish 
or a non-transgenic fish to produce viable and fertile transgenic progeny. 

25 9. The transgenic fish of claim 7, and progeny of said fish that emits green 

fluorescence when the whole fish is exposed to a blue or ultraviolet light. 

10. A transgenic fish comprising a DNA that encodes a fluorescent protein 
under control of a promoter that causes said DNA (1) to be expressed in predominately 
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skin epithelia, (2) to be specifically expressed in muscles, (3) to be predominantly 
expressed in skeletal muscles, or (4) to be expressed ubiquitously in all tissues. 

— 11. The transgenic fish of claim 8, wherein said fluorescent protein is expressed 
a level sufficient that said fish fluoresces upon exposure to sunlight or daylight. 

5 12. The transgenic fish of claim 11, wherein said high expression is induced by 

exposure of said fish to a steroid compound or to a heavy metal. 

13. The transgenic fish of claim 10, wherein said promoter is a promoter which 
naturally occurs in the genome of a fish of the same species as the transgenic fish. 

14. A recombinant DNA vector comprising a promoter DNA that hybridizes 
10 under stringent conditions to a polynucleotide of any one of SEQ ID NOS:7 s 8, 9, or 22, 

operatively linked to a structural gene encoding a fluorescent or chemiluminescent protein. 

15. A cell transformed with the vector of claim 14. 

16. A transgenic fish comprising a chimeric gene in turn comprising a promoter 
DNA that hybridizes under stringent conditions to a polynucleotide of any one of SEQ ID 

15 NOS:7, 8, 9, or 22, operatively linked to a structural gene encoding a fluorescent or a 
chemiluminescent protein. 

17. A method for sensing a steroid hormone or a steroid hormone derivative in a 
water sample comprising: 

(a) contacting a fish expressing a fluorescent or chemiluminescent 
20 protein under control of an estrogen- or other steroid hormone-inducible promoter with a 

sample of water; and 

(b) measuring the amount of fluorescent or chemiluminescent light from 

said fish. 

18. A method for sensing heavy metals, such as zinc, copper, cadmium, mecury 
25 etc., in a water sample comprising: 

(a) contacting a fish expressing a fluorescent or chemiluminescent 
protein under control of a heavy metal-inducible promoter with a sample of water; and 

(b) measuring the amount of fluorescent or chemiluminescent light from 

said fish. 
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Abstract of the Disclosure 

Four zebrafish gene promoters, which are skin specific, muscle specific, skeletal muscle specific 
and ubiquitously expressed, respectively, were isolated and ligated to the 5' end of the EGFP gene. 
When the resulting chimeric gene constructs were introduced into zebrafish, the transgenic zebrafish 
emit green fluorescence under a blue light or ultra-violet light according to the specificity of the 
promoters used. Thus, new varieties of ornamental fish of different fluorescence patters, e.g., skin 
fluorescence, muscle fluorescence, skeletal muscle-specific and/or ubiquitous fluorescence, are 
developed. 
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and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and 
that such willful false statements may jeopardize the validity of the application or any patent issued thereon. 
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^Zhiyjaan GONG 
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Residence (City, State & Country) ~ ? 
S^apore^Singapore 


CITIZENSHIP 

Canada ' S G- X 


MAILING ADDRESS (Complete Street Address including City, State & Country) 
113 Clementi Rd. #07-01, Singapore 129793 


GIVEN NAME/FAMILY NAME 
Jiangyan HE 


INVENTOR'S SIGNATURE 


DATE* 


Residence (City, State & Country) 
Cleveland, Ohio, USA 


CITIZENSHIP 
P.R. China 


MAILING ADDRESS (Complete Street Address including City, State & Country) 
2636 North Moreland Bldv., Apt. 8, Cleveland, OH 44120, USA 


GIVEN NAME/FAMILY NAME 
Bensheng JU 


INVENTOR'S SIGNATURE 


DATE* 


Residence (City, State & Country) 
Singapore, Singapore 


CITIZENSHIP 

P.R. China SG-X 


MAILING ADDRESS (Complete Street Address including City, State & Country) 
Blk 403, Clementi Avenue 1, #06-206, Singapore 120403 , i J 
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Xoong Jin IANj 


INVErfr&R'S SIGNATURE 
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Residence (City, State & Country) / / 
Singapore, Singapore 
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Singapore 5 G~ 


MAILING ADDRESS (Complete Street Address including City, State & Country) 
7 Second Avenue, Singapore 266657 


GIVEN NAME/FAMILY NAME 
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INVENTOR'S SIGNATURE 


DATE* 


Residence (City, State & Country) 
Chicago, Illinois, USA 


CITIZENSHIP 
P.R. China 


MAILING ADDRESS (Complete Street Address including City, State & Country) 
924 East 57th Street R122, Chicago, IL 60637, USA 
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Attorney Docket No. 1781-0163P 

I hereby appoint the practitioners at CUSTOMER NO. 2292 as my attorneys or agents to prosecute 
this application and/or an international application based on this application and to transact all 
business in the United States Patent and Trademark Office connected therewith and in connection with 
the resulting patent based on instructions received from the entity who first sent the application papers 
to the practitioners, unless the inventor(s) or assignee provides said practitioners with a written notice to 
the contrary: 

Send Correspondence to: 

BIRCH, STEWART, KOLASCH & BIRCH, LLP or CUSTOMER NO. 2292 

P.O. Box 747 • Falls Church, Virginia 22040-0747 
Telephone: (703) 205-8000 • Facsimile: (703) 205-8050 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information 
and belief are believed to be true; and further that these statements were made with the knowledge that willful false statements 
and the like so made are punishable by fine or imprisonment or both, under Section 1001 of Title 18 of the United States Code and 
that such willful false statements may jeopardize the validity of the application or any patent issued thereon. 
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GIVEN NAME / FAMILY NAME 
Zhiyuan GONG 
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DATE* 


Residence (City, State & Country) 
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MAILING ADDRESS (Complete Street Address including City, State & Country) 
113 Clementi Rd. #07-01, Singapore 129793 
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Residence (City, State & Country) 
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Residence (City, State & Country) 
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MAILING ADDRESS (Complete Street Address including City, State & Country) 
7 Second Avenue, Singapore 266657 




GIVEN NAME/FAMILY NAME 
YanfeiXU 


INVENTOR'S SIGNATURE 


DATE* 


Residence (City, State & Country) _J / 
Chicago, Illinois, USA 
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924 East 57th Street, R122, Chicago, IL 60637, USA 
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Attorney Docket No. 1781-0163P 

I hereby appoint the practitioners at CUSTOMER NO. 2292 as my attorneys or agents to prosecute 
this appucatidn and/or *an international application based on thjs5 application and to transact all 
business in the United States Patent and Trademark Office connected therewith and in connection with 
Sw^dto^atentlSSi on instructions received from the entity who first sent the .application papers 
to the practitioners, unless the inventor(s) or assignee provides said practitioners with a written notice 
to the contrary: 

Send Correspondence to: 

BIRCH, STEWART, KOLASCH & BIRCH, LLP or CUSTOMER NO. 2292 
P.O. Box 747 • Falls Church, Virginia 22040-0747 
Telephone: (703) 205-8000 • Facsimile: (703) 205-8050 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information 
and belief are believed to be true; and further that these statements were made with the knowledge that willful false statements 
and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and 
that such willful false statements may jeopardize the validity of the application or any patent issued thereon. 



GIVEN NAME/ FAMILY NAME 
Zhiyuan GONG 


INVENTOR'S SIGNATURE 


DATE* 


Residence (City, State & Country) 
Singapore, Singapore 


CITIZENSHIP 
Canada 


MAILING ADDRESS (Complete Street Address including City, State & Country) 
113 Clementi Rd. #07-01, Singapore 129793 
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Singapore, Singapore 
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